Gene Haur_0730 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_0730 
Symbol 
ID5732616 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp839405 
End bp840961 
Gene Length1557 bp 
Protein Length518 aa 
Translation table11 
GC content51% 
IMG OID641277860 
ProductTPR repeat-containing protein 
Protein accessionYP_001543506 
Protein GI159897259 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2956] Predicted N-acetylglucosaminyl transferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.444597 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTACCA ATGTTGACAC TTTGAATGTA ATGCCGCTGG ACCTCGATAC CTTTGCTGGG 
AGCGAGCGTT TTATGGCTGG GACGCGCTTG GGTGCTGCGT TTGGGCGCGG GATCAAAGCC
TATTTGGCCG CCAATTATCC TGATGCAATC GAGCATTTCA AAACTGCATT AATCGCTGCC
TATGTTGAAG GCGATGAGAA ATCGCAAATC TACGATCGCG AGCGGGCGAT TATCTATCTG
TATATTGGCA ATGCCTGTGC TTTTCAAGAT GATTGGCCGA CGGCTCAGCG TGAGTATTTG
GAATCGGTAC AAACCGACCC GCAATTGGCC GAAGCCCACT ACAATTTGGG TGTTGCTTTC
GGCGCGATGC GCCAAATTGA TCGGGCAATT GGCGCATTCA AGGAAGCGCT TGAACATAAT
AATGGCTTGT ACGAGGCCCA TTTTGCACTT GGCCGTGCTT ATCAAATGCT CGATGATGCT
GGCCGCGCCT ATATTCACTT TACTTCAGCC CGTGAATTGC GGCCTTACGC TGGCGAGCCT
TTGTATTACA TGGGTTTGAT GCACCAAGCC CATGGTGCTC ACGAATTGGC ACAGCGCTGT
TTTGCCGAGG CGCTGCGAGT TGAGCCAACC TTTATCTCGC CTAGCGTTGG CCCCGATGAA
GTGCTGGTTA GCAAATCGGA GCAAGAGGTT GCCAATTGGT ACTATCGTTT GAGTGACGAC
CTTAAATCCC AAGGCTATGA TGAAGATGCC AAGCGGATTT ACGAGGCTTT GCTGAAATGG
CGACCAACCG AGCATCGTGC CCGCTATTTG TTGGGCAATT TGTTGGCGCG GGCGCGACGT
TGGGAATTGG CTTTGCAAGA ATATGGCCGA ATTCCACCGC AAGATCGCTA CTATGTTGAT
GCGCGACTGC GGATCAGTGC AATTTTGCGT TTTCAGGGCA AGCATCGCGA GGCCTATAAC
CATTTGTATA GTACCGCCAA GTTGCGGCCC AACGATGGTC AAGTGTTCTT GCAAATGGGC
AAATTGCTCT ATGATTTAGA GAAACATCAT GCTGCGATGC GTGCTTTTGA ACGCGCGGTC
CAACTTTTGC CCAAAGACCC TAATGCCTAC TATTTGTTAG GGTTTATGTA CACGGTGTTG
GGCCATGAAA GTTGGGCCTT GGCGGCTTGG CGCAAAGCAG TGGAGTTAGC GCCGCATGCC
CATTCATTGC GCTACGATTT GGGCTATATG TACACCCGCC GTCGCCGCTA CGATTTGGCT
TCGCGCGAAT TTAGCCATGT ACTGATGCAT TGGCCCGATG ATATTGAAAC CACCTTTATG
CTGGGCACAT GTTATAAAGA AATGTTAGAA CCAGCCCAAG CCATACCGTT ATTTGAAAAA
GTGCTGCGCC GCAACCCACG TCATACCCAA GCACTCTACT ATTTGGGGGC ATGCTATCTG
CAAGTTGGCA ACAGTTCTTT GGGTAAGGCC TACCTGCGCC GCTACGAACA TCTGATCAAC
CAACAGGAAC AATTACCACA AAAGGGCCGC GCAATGGCTC GTGGAGGTTC GCGATGA
 
Protein sequence
MTTNVDTLNV MPLDLDTFAG SERFMAGTRL GAAFGRGIKA YLAANYPDAI EHFKTALIAA 
YVEGDEKSQI YDRERAIIYL YIGNACAFQD DWPTAQREYL ESVQTDPQLA EAHYNLGVAF
GAMRQIDRAI GAFKEALEHN NGLYEAHFAL GRAYQMLDDA GRAYIHFTSA RELRPYAGEP
LYYMGLMHQA HGAHELAQRC FAEALRVEPT FISPSVGPDE VLVSKSEQEV ANWYYRLSDD
LKSQGYDEDA KRIYEALLKW RPTEHRARYL LGNLLARARR WELALQEYGR IPPQDRYYVD
ARLRISAILR FQGKHREAYN HLYSTAKLRP NDGQVFLQMG KLLYDLEKHH AAMRAFERAV
QLLPKDPNAY YLLGFMYTVL GHESWALAAW RKAVELAPHA HSLRYDLGYM YTRRRRYDLA
SREFSHVLMH WPDDIETTFM LGTCYKEMLE PAQAIPLFEK VLRRNPRHTQ ALYYLGACYL
QVGNSSLGKA YLRRYEHLIN QQEQLPQKGR AMARGGSR