Gene Haur_4347 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4347 
Symbol 
ID5736207 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp5554541 
End bp5556433 
Gene Length1893 bp 
Protein Length630 aa 
Translation table11 
GC content53% 
IMG OID641281508 
Producthypothetical protein 
Protein accessionYP_001547107 
Protein GI159900860 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.954065 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTAGTTG ATACGATGAC GATCTTCAGC CAGCGTCGCC AGTGGTTGAC GCTGGTTACG 
TTATTAATTG GGCTTTTAAT TGGTTTGGGC TTTGTATCCA ATCGCCAATG GCAAGTGCCG
CTGAATCTGG GCTTGGAAGA GGGCATCAAC AACGATGCTC CATTTGTGGT TGGCTTTAAT
GCTGGCGAAC AATTGCCCGA TCGTTCAGCT CGTTACCGTT GGTCAACCCT GAATGCCCAA
TTGCGCTTTC CGCATGTGCC GCAACGCCAC TACTGGCTCG ATTTGCCACA ACTTACGGGC
AACCCAGCCA GCCTCATCAT GGGAACCAGC CAATTTACCA GTACTGCCGG ACGGGTATTG
CATGTGTTGT TGCCTGCCGA TGCCGCTGGT AAAGTGGCAA TTACGGTTGC TCAACCAGTG
GTGAGTGATG ATCCACGCGA GCTCGGGGCG GCGTTTAGTG GTGGGCAATT AAGCAGCAGT
GGTTGGGCTT GGCCTGGGCT CTATCCGACC TTAGCTTGGT TGTTTTTGCT GACGACACTT
GGTTTGAGCA TAATCTGGCT TGGTGGTTCG AGCCTTGAAG TTGGCTTGGC AGTTGGCCTA
ACAGGCTTGG GGATTGTGGC GGCAACTTGG TTTGCGCCAT TACGGGCAAG CTATGCCGCA
CCAGCGGTTG CCCAAACAAC GCTCTATGGT TTGGCTGCAT TGCTGTTGTT GGGCTGGGCC
TTGCCGCCAA TCTTGCAGCG CCTAGGCTTA ACGATCAGCC GCGATGTCTT GCGTTGGCTG
ATTTTGGCGA CGGTGCTGGT CTGGTCGCTC AAATTGGGTG GACGCTTGCT GCTCGAGCAT
ATGCCAGGCG ATATTGGCTT TCATCGCAAC CGAATTCATG CAACCAACCT TGGCGATTTA
TTCCGACCAT CGCGCCATCG CGGCATCGAT TTTCCCTATC CGCCAGTGTT GTATGCCTTG
TTGCAGCCAC TGACCTTGAC TGGAATTTCA GCCGATTGGT TGTTGCAATT AACCGCCGCA
GCCTGTGAAG CCCTGGCGAT TCCGGTGCTA TTTTGTTTGG GCTTGCGCAC AACTGGCTCA
AGCCGTGGAG CGCTGGTTGG CGCGATTATG TATGGGCTTG TGCCAGCAGG CTTTATGACC
AACGCATGGT CGTTTGATTC GCACATTTTT AGCCAATTTG TGGCCTTGCT ACTGGCAACA
TTTATGGTTT GGACGTGGCA ACACTGGCAT GAACGACGTA ACTGGCTTTG GATAACTTTG
GGTTTGAGCA CGATTGCGCT CGGACACTTT GGCTTTTATC TTAATACTGG CTTGATGGGC
GGCTTGCTGA TGCTATGGCT GTGGTGGCGC GGCCCACGTT CGCAAGGCTG GGCCTTATTC
ACCAGCTTGG TAGCAACCCA AGTGATTGTT TGGGCTTTGT ATTACTCAAG TTTTATTGGG
CTATTTTTGC AGCAAGGCCA ATCGTTTGCC GAAGGCGGCA TGAACGCAGT CAATCAGCGC
GAAGCTGTAC CACGCCTGCA ACTCCTGTGG GATATGATTG ATCTTGGGTT TTGGCGGCAT
TATGGTTTAC TGCCTGTCTT AATCGCGCCA TTTGGTTGGT GGCTCAGTCG CAAACATCGC
GGCTTGCAAT TGGTCATGGG GGCGACGTTC GTCGTTAGTT TGATCTTGGC GGCCTTTCCA
ATTATCAATG GATCGACCAT CACCACCCGT TGGCTAATGT TTAGCGCTTG GGCGATTGCC
TTGGCCACTG GCATTGCCCT CGATTGGTTG TGGCAACGCA CGCGTTGGGG TCGCTGGCCT
GCGATTCTGA TTACTAGTGG TTGTGCGATT TTTGGCATGA TCGTCTGGTT TGCTGCGATG
GTCTATAAAA TTCGCCCACC GGAACCGTTT TAA
 
Protein sequence
MVVDTMTIFS QRRQWLTLVT LLIGLLIGLG FVSNRQWQVP LNLGLEEGIN NDAPFVVGFN 
AGEQLPDRSA RYRWSTLNAQ LRFPHVPQRH YWLDLPQLTG NPASLIMGTS QFTSTAGRVL
HVLLPADAAG KVAITVAQPV VSDDPRELGA AFSGGQLSSS GWAWPGLYPT LAWLFLLTTL
GLSIIWLGGS SLEVGLAVGL TGLGIVAATW FAPLRASYAA PAVAQTTLYG LAALLLLGWA
LPPILQRLGL TISRDVLRWL ILATVLVWSL KLGGRLLLEH MPGDIGFHRN RIHATNLGDL
FRPSRHRGID FPYPPVLYAL LQPLTLTGIS ADWLLQLTAA ACEALAIPVL FCLGLRTTGS
SRGALVGAIM YGLVPAGFMT NAWSFDSHIF SQFVALLLAT FMVWTWQHWH ERRNWLWITL
GLSTIALGHF GFYLNTGLMG GLLMLWLWWR GPRSQGWALF TSLVATQVIV WALYYSSFIG
LFLQQGQSFA EGGMNAVNQR EAVPRLQLLW DMIDLGFWRH YGLLPVLIAP FGWWLSRKHR
GLQLVMGATF VVSLILAAFP IINGSTITTR WLMFSAWAIA LATGIALDWL WQRTRWGRWP
AILITSGCAI FGMIVWFAAM VYKIRPPEPF