Gene Haur_5245 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_5245 
Symbol 
ID5737203 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009974 
Strand
Start bp13830 
End bp15752 
Gene Length1923 bp 
Protein Length640 aa 
Translation table11 
GC content64% 
IMG OID641282409 
Productpeptidase domain-containing protein 
Protein accessionYP_001548000 
Protein GI159901755 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.99719 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCCGCA CCCTTGTGCG GCGCTGGCCC ACGATGAGCA TGCTGCTGAT CGTGTGGTTG 
GTGCTGCTCA CGACGACCGC CCGCGCATCC ACGCCGCTGG CCGTTGCATT CAATGTGTCC
GAAGCAGGCC ACGGTGAGGC CGTGACGTTA ACGATCACCG CCGCTGTGCC GTTGGCCACG
CTGATCACGC TCGACCTTGA CCTCAGTTTG ACCGTGACGG CCTCGGCCAG CGATTGCCGC
CGTGTCGCGG CCCAGTGGCA GTGCCGCACG AGCGACAGCC CATGGATCGG GGTGTTTGCG
GTGGCGTTTG ACCCCGCGAC CCCCGCCGGA ACGGTGGCAA CGGCCCATGT CACCGCGCCC
AATGCCAGCG CAACGGCGCA GCTGGTGGCG GTGGGGATTC CGGCAGCCAC GGCGACCGCG
ACGACCACCG CCACCAGTCA ACCAAGCGTA ACGGCGACGA GTCCAAGCGG CGCGACCGAA
ACCCCGTGGC TCACGGCTAC GCCATGGCTG ACCCCGACCC CCATCCTCCC CAGCGCTGAG
CCAGTGCCGG ATAGTGCTGA GCCAAACAAT GAGGCAGGTC GCGCCACCCC GTTGGGTGTG
CCCATCACGC TGGATAAACT CTCGTTTTGG CCGCTTGGGG ATGTGGATTA TTTCGCGGTG
CAGGTGAAGC CGAGCCAAGC CGGATTGACC TTAACCATTA ATACCTATCT GACGGTGGGC
TTGGATACCC AGCTGCGATT GCTCACGCGT GATGGAGCCG AGGTCGCGAG CAATGACGAT
GTTGGGCCAA CCGATCCACG TTCGTCACTG GCCATCCGCG CGGAAGCCGG AACCTATCTG
CTCGAAGTGC GGAACGTCGC CCCGACGCAT CCAGCCTTCA AAACCTATCG CTTGGAGGTG
GCGCTGATGA TGGCTCCCAC CGCCGCGCCA ACCACCCCAC CCGAAGCCTC GCCCGCAGCG
GCCCCATGGG ATAGCTACCA CGGCAACTAC CAGTGGGACA CCGCCGCCTT GATTCCGATT
GGCGACACGG TGGAGGGCTT GACCTTTGGC TGTCCTGACT ACACCTATTT GGATCTGGAT
AGCTGCACCG TGCCCGACTT TTTCACGATC AGTGTGAAGG GCGGGCTGTG TTATTCGGCC
AACACCACGG TTGCAGCAGG GGTTGATACC AACTTAATTG TCTATGGCCC CGACCGCGAC
ATGGCCGCAC CGTGGGCTGG CAACGACGAC GCAGCCCCCG ACGAGGCAGG CAGCACAGTG
CTGTTTTGTG TGCCGGAAGC GGCAGGCGTG TTGGATGCCT ATCTGCTGAT TGGCCAGGTT
GGACGGTTGG CCCCACCGCT GAACGAACGC TCGTACACGC TCAGGGTTGA CCGTTGGCTA
CCGACCACGG CCACGCCCAC GGCCACGGCG ATCCCCATGG GCACAGGCGG CACGGCAGGC
GGTAGCGTCC CAAGCAACGG TGGTGATGGC ATGGGCGGTG TTCCGAGTGG TAGCAATCCG
CAGCCCCAGC CTGCCGCGCC GCAACCAACG ATGAGCGAGC GCGATACGCC CTTGGCAGGG
ATTCAGATTG AAGAAATCCC GATCGCGCAA GCGGCCCAAC CCACGGCGCA GCCGACCATC
CTCGTGCCCC TCAGTGTCGT GGTCTGTTAC GACGGGGTGC ATGCCAATAA AAGCTGTGAC
ATCGACGAAG GGGTGGCAGG TGTGACGGTC TACGTGACTG ACGAGCAGAG CGGAACCATC
CTCGCCCAAG CCGTGACCGA CCCCAGTGGA CGGGCAGCGA TGAGCGTGCG GGTGCACGAC
ACCGCCATGC TGATTGTGAG CATGCCGAGC TTCGATGCGA CCCAACGGGT ATCCGCCCGC
ATGCCGCAGG TCAAGCCGAT CATGGTGTCC ACCGTGACCC CACTACCCGC GCTCTTACCG
TGA
 
Protein sequence
MTRTLVRRWP TMSMLLIVWL VLLTTTARAS TPLAVAFNVS EAGHGEAVTL TITAAVPLAT 
LITLDLDLSL TVTASASDCR RVAAQWQCRT SDSPWIGVFA VAFDPATPAG TVATAHVTAP
NASATAQLVA VGIPAATATA TTTATSQPSV TATSPSGATE TPWLTATPWL TPTPILPSAE
PVPDSAEPNN EAGRATPLGV PITLDKLSFW PLGDVDYFAV QVKPSQAGLT LTINTYLTVG
LDTQLRLLTR DGAEVASNDD VGPTDPRSSL AIRAEAGTYL LEVRNVAPTH PAFKTYRLEV
ALMMAPTAAP TTPPEASPAA APWDSYHGNY QWDTAALIPI GDTVEGLTFG CPDYTYLDLD
SCTVPDFFTI SVKGGLCYSA NTTVAAGVDT NLIVYGPDRD MAAPWAGNDD AAPDEAGSTV
LFCVPEAAGV LDAYLLIGQV GRLAPPLNER SYTLRVDRWL PTTATPTATA IPMGTGGTAG
GSVPSNGGDG MGGVPSGSNP QPQPAAPQPT MSERDTPLAG IQIEEIPIAQ AAQPTAQPTI
LVPLSVVVCY DGVHANKSCD IDEGVAGVTV YVTDEQSGTI LAQAVTDPSG RAAMSVRVHD
TAMLIVSMPS FDATQRVSAR MPQVKPIMVS TVTPLPALLP