Gene Haur_3409 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3409 
Symbol 
ID5735270 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp4295411 
End bp4296748 
Gene Length1338 bp 
Protein Length445 aa 
Translation table11 
GC content51% 
IMG OID641280556 
Producthypothetical protein 
Protein accessionYP_001546173 
Protein GI159899926 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTATCAG CGGAATTTAG CGCGAGTAAC GACCTCATCC AAGCGATCGA TCAACTGTTG 
CTGGCAGGCT TTACCAATCT CAATCATAGC CATCAAATGA GTTTGCAGCG GGTGGCTCAA
GTTTATCGCG ATACGCCGTT TGAGCAGGCG ATGCTCAGCG CTGTGACTGA TTTGAGCAAT
GGTGTGTTTC AACCAGCCGC GTTTATGTTG CTGGCGGTTG CCCGCGCATC GCTGCAAGCT
GCCCAGCATG ATCAACTTTT GCAGCAAATT CGGGTGCAAC TTGGTCGCCC AATCAATGAT
CAATCAACAT CAAAAGCCGT GGCGCTTGCC GCAACTCCGC CCTTGTTGGG TAGCGTGCAG
CATTGGCTGA CCGATTTAGC AGTAATGGGC TTTTCGCGAC TGGAGCCAGC GATGATCAAC
GCCTTTACGC CAACCCTCGC CCAACTGCAA ACTAACCCCG ATTATCTGCG CACTTCGGCG
ATTCTCTCTG GTTGGCTGCA TGAATTGCAA CTTCAGCCTG AACAATTGCC TTTGTTTCGT
TGGGGCGATT TGTGGACGCG GGCCATGCTT TCGACCCTCA GCCTGAATTC AACCCCACCA
ACCCAACCTG TCAGTGGCAC GCTGTACCCC TTGGGCATCG AATGGCGACA ACATTCCACG
CTAGTTAGTT TGGTGGTCTA TGCTGTGCTT GAGGCCGATT CCAAGGCTAG CTTGGCTACA
ATCAGCCAAT CGGCCTATAA AGTTGCGGCA ATTCAGCAGG ATCAATTGTG GTTGCTGTTT
CCTGAGTTGA GCCTGTTGTT CGATAGCTTG AGCACAGCCA AAGCCTTAGT GTTGCGCGAT
GCAGCAAGCT TGCCAACTGG CCAATTGCTG TGGGATGCGC AGACCGCCAG TTTGGGCGCT
AAATATGATT TGCTTGATGT GGCCGAACGC TATTTTGGCC TGAATCCCAA ACAATCAATT
GCCCAAGCTC AACTTGCCCC CCAACAACGC CACCCAGTCC ATGTGGCCGA GCCAGTTGTT
TTGAGCAACT ATCAACTCAA TCAAACTACC GAAGCTTGCA CAATAACGAC TGGTGAACAC
AGTTTCATGC TTGAGCTTGG GTTGATCAAC GGCACTGAAA TCGATTTGGC GGTGCTTGAA
TCGGCTCAAC GCTTGTTTGG CTTGGTGCGC TACGATGCTG GCGAGTGGCT GCTGCGACCT
TTGGCAACCA GCCTGAAAAA AGGCAAGCCG CTATTTATTG GGCTAGAAAA CGGCAAAGTT
TTTAAGAAAG CGCCTAAAAA TAATGCTGTT GGCATTCTCA AAGAGCGTGC TAGCCGCTTG
CTCAGGGAGA AATCATGA
 
Protein sequence
MLSAEFSASN DLIQAIDQLL LAGFTNLNHS HQMSLQRVAQ VYRDTPFEQA MLSAVTDLSN 
GVFQPAAFML LAVARASLQA AQHDQLLQQI RVQLGRPIND QSTSKAVALA ATPPLLGSVQ
HWLTDLAVMG FSRLEPAMIN AFTPTLAQLQ TNPDYLRTSA ILSGWLHELQ LQPEQLPLFR
WGDLWTRAML STLSLNSTPP TQPVSGTLYP LGIEWRQHST LVSLVVYAVL EADSKASLAT
ISQSAYKVAA IQQDQLWLLF PELSLLFDSL STAKALVLRD AASLPTGQLL WDAQTASLGA
KYDLLDVAER YFGLNPKQSI AQAQLAPQQR HPVHVAEPVV LSNYQLNQTT EACTITTGEH
SFMLELGLIN GTEIDLAVLE SAQRLFGLVR YDAGEWLLRP LATSLKKGKP LFIGLENGKV
FKKAPKNNAV GILKERASRL LREKS