Gene Haur_3144 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3144 
Symbol 
ID5735016 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp3971342 
End bp3972625 
Gene Length1284 bp 
Protein Length427 aa 
Translation table11 
GC content46% 
IMG OID641280287 
Producthypothetical protein 
Protein accessionYP_001545909 
Protein GI159899662 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.613992 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCAACCA ACGTGCCTCA ACTCAGTTTT AATATCAATC TCCAAGCTGA TGAACGGGTC 
AAGTTGATCG CCTATCGCCA TTGGCTAATC TTGGCACGCA ATCTGTCAAT TTGGTTTACC
TTTTGGTTTA TTTGTACCGT GATTTTCTTG TGGCGGGTCT CGGTGGTTGG CGTTGATACC
CTGAATACTG TGCTGTTTGG CGCTGGTACG ATCTTTTTCG GGGCCATGAT TTATAGCTAT
TTCGATTGGC GCAATGATGC CTTGGTCGTT ACCAATCAGC GGGTTATTTC GTATAACTCG
CGTTTTTTGA TCAGCGTGCA GCGCAATGAG CTGTATGTGC GCGAAATTGA AGATGTAAAA
ACCGTCACCG AGTCGGTGGT TTCACGCTAT TTCGATTATG GCGAAATTGA AGTCCAAACT
GCCAGCCGTT TGCGCAACAT TGCCTTCGTG GGAATCGTCA ATCCTACGTT GGTACGTGAC
ACAATTCTCG AATTTGTTGC GCCGCTCAAA GAAGTTGAGC ATGTTGAGCA TATTCAGCAA
ATCGTCAGAG CCAAAGTGCT CAAACAAGGC ACAATGCCAA GCTTGCCACC ATTGAGCGAT
TTTATTCCAC CAGAACAAAC TGGCCGTACA CTTTTGGGCA TTATTCCGCC AAGCCCCGAG
GTGCGCGGCA ATTCGATTAT TTGGCGCAAA CATTGGCTTT TTCTCTTTGT CGAGGCAGCT
AACCCAATTT TGCTGTTTCT AATTATCAAT TTGTCGTGGT CACTGTTGCT CGGCTACGAT
TTTATTCGGT CTGGTGGCTC GTTAGTATTT TTGGCAATCC TCGATATCTT TTGTCTCGGC
TGGCTGATTT ACGAGGTGAT TGATTGGCGC AACGATGAAT ATATTGTCAC CCCAATCAAC
ATTATTGATA TTGAGCGCAA GCCCTTGGGC CGTGAAACCA AACGGGAAAC AACCTGGGAC
AAAATTCAGA ATGTTTCGCT GAATCAAGAA AATTTATGGG CACGCATTTT GAAATACGGC
GATGTCGAGC TATTTACCGC AGGTCAAAAC GAAAATTTCA CCTTTCGCGG GGTAGCCGCA
CCCGATAGCG TGTTGGCAGT CATTTCGGAT TATCGCGACC AATTTGAGCA GCGGGCGCGT
GACCGTGAAT TTGATAGCAC CTTAATGCTG TTGCAACATT ATCACCAACT ACAACGCGAT
GAACTCCAAG TGCTGTTTGA TGATCATCGC AGCCATATCG AAGCCAAATT GCCGCCAACC
GAGCGGTTGG AAACTGGAGT GTAA
 
Protein sequence
MPTNVPQLSF NINLQADERV KLIAYRHWLI LARNLSIWFT FWFICTVIFL WRVSVVGVDT 
LNTVLFGAGT IFFGAMIYSY FDWRNDALVV TNQRVISYNS RFLISVQRNE LYVREIEDVK
TVTESVVSRY FDYGEIEVQT ASRLRNIAFV GIVNPTLVRD TILEFVAPLK EVEHVEHIQQ
IVRAKVLKQG TMPSLPPLSD FIPPEQTGRT LLGIIPPSPE VRGNSIIWRK HWLFLFVEAA
NPILLFLIIN LSWSLLLGYD FIRSGGSLVF LAILDIFCLG WLIYEVIDWR NDEYIVTPIN
IIDIERKPLG RETKRETTWD KIQNVSLNQE NLWARILKYG DVELFTAGQN ENFTFRGVAA
PDSVLAVISD YRDQFEQRAR DREFDSTLML LQHYHQLQRD ELQVLFDDHR SHIEAKLPPT
ERLETGV