Gene Haur_3646 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3646 
Symbol 
ID5735507 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp4585983 
End bp4587464 
Gene Length1482 bp 
Protein Length493 aa 
Translation table11 
GC content51% 
IMG OID641280795 
ProductComEC/Rec2-related protein 
Protein accessionYP_001546410 
Protein GI159900163 
COG category[R] General function prediction only 
COG ID[COG0658] Predicted membrane metal-binding protein 
TIGRFAM ID[TIGR00360] ComEC/Rec2-related protein 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGACTTT GTGGTTTTAC TGCTGGCTGG CTGCTTGGGC TGTGGTTAAA TGATAGGCTA 
CAAATAGCAT GGTATCTATA TTTTATGGCA TCAATAGCGA TTATTTTACT CATTATCTAT
ATCCGAAAAT CATGGCGGGT TATGCTGATA GCAATATATG CTGGTACTAT GCTAGGGGTA
GTACGAATGG CGGTCAGTCA AAATTCCTCA AATCTTGACG ATATTCGTCA ACAGATTGGC
ATGACAACTC GTTTAGAAGG AGTAATTGTC GGCCAGCCGC AATGGACTCC ACAGCAACAA
CGGGTGGTTT TAGCCGTCCA TGCTTATCAA GATAATCAGC AACGAGTTGC CACAACTGGC
AAGATTATGT TAACTCTTCC AGCCGAGCCA CCACGCAGTA ATGGCGAACG TTTGTTAGTC
AGCGGCACAA TCATCACGCC AACCGCTAGC CCAAATTTCG ATTATGCCGA CTATCTACGT
CGCCGCTCGA TTTATGCCAT GCTTGAGCCT GCAACCGTTG AACAAGCCTT GCCTGCAAAA
AATTCAGTCT ATCAACGCTT AATCGCGCTC AAACAGCGCT CGCAAACAAT CATCAACCAA
ACGTTGCCGC AACCGCAAGC AGCAGTTTTG GTCGGAATGT TGCTGGGAGT CAAAAGCAGC
GTGCCTCAAA CTGTGTGGGA TACCTTCAAT CGCACTGGGC TTTCGCATAT CTTAATCATC
TCAGGCTGGA ATATTACAAT TGTGGTGGCG GCGTTATTGG GTTTGGGCAA GGCCTTAAAG
CTTAGTCAAC GCCACGCCAC CATGGTGGCA ATTGGCGCAA TTGTGGTGTA TGTAGCCTTT
GTGGGAGCTA GCGGGGCTGT GATTCGCGCT GCCTTGATGG GCGCAATCGT GGCACTAGCC
CAGCCGCTTG GTCGCAAATC CGATGCTTGG GCGGCACTCG CGGCAGCAAC TTGGCTCATG
ACCCTGATCG ATCCGCACAC CTTATGGGAT TTAGGCTTTC AACTATCAGC TTTGGCCACG
GCGAGCTTGT TTGCTTGGGG CAAGCCAATT GAAGCTCAAT TGCGGCAGTG GCTGCGTTGG
CGCTGGCTCG AATGGATGAT CGAGCCATTG ACCGCAACAT TGGCCGCCCA AATTTGGACA
CTGCCGATCA TTCTGTATCA TTTTGGTAAT CTCTCGTTGA TTGCACCCGT CGCCAATGTA
CTGATTGTGC CAGTTGTGCC GTTGATTATG GCCAGCGGCG CAATGCTAGC ATGCTTGGGG
TTGTTTGGCC GTTGGTTAGC ATTGCTGGCC TTGCCAATCA CATGGGCGGC ATTAACTTGG
GTCGTTGAGG CCGCCGAATG GCTGGCCGAT TTATCTTGGG CAGCGGTCGA AATTCCTAGG
TTTGGCATGA GCTGGCTGGT GCTGGCGTAT GGCTTGAGCG TTGGGGCGAA GGCGTGGATG
GTTAACCACG AAGAACGCGA AGGACGCGAA GTAAGAATTT AA
 
Protein sequence
MRLCGFTAGW LLGLWLNDRL QIAWYLYFMA SIAIILLIIY IRKSWRVMLI AIYAGTMLGV 
VRMAVSQNSS NLDDIRQQIG MTTRLEGVIV GQPQWTPQQQ RVVLAVHAYQ DNQQRVATTG
KIMLTLPAEP PRSNGERLLV SGTIITPTAS PNFDYADYLR RRSIYAMLEP ATVEQALPAK
NSVYQRLIAL KQRSQTIINQ TLPQPQAAVL VGMLLGVKSS VPQTVWDTFN RTGLSHILII
SGWNITIVVA ALLGLGKALK LSQRHATMVA IGAIVVYVAF VGASGAVIRA ALMGAIVALA
QPLGRKSDAW AALAAATWLM TLIDPHTLWD LGFQLSALAT ASLFAWGKPI EAQLRQWLRW
RWLEWMIEPL TATLAAQIWT LPIILYHFGN LSLIAPVANV LIVPVVPLIM ASGAMLACLG
LFGRWLALLA LPITWAALTW VVEAAEWLAD LSWAAVEIPR FGMSWLVLAY GLSVGAKAWM
VNHEEREGRE VRI