Gene Haur_4803 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4803 
Symbol 
ID5736648 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp6127380 
End bp6129251 
Gene Length1872 bp 
Protein Length623 aa 
Translation table11 
GC content41% 
IMG OID641281969 
Producthypothetical protein 
Protein accessionYP_001547562 
Protein GI159901315 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0267485 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGTTCA AAAGGTTCGT ATGGGTTAGC CTAGGGTTGG CTGGACTCAG CTTGATGCTC 
TTGGTTGGCA ATACACTCAG TTTACGGCTA TTTGGCCAAA TTCAACAGAT TGATGTTGGC
AATTGGGGCG ACCAAAATAA TGTTGCTGGC GGCTTCGAGC AAGAACAAAA CGCCCTCGGT
GAAACCTACC GTTGGACTCA AGCCGAGGCT ACGATTCGGC TTCAGGGCTA CGGTTCGGCT
AGCTCACGCT TGCTTAGCTT AAAAATAGGT GGCGTACCAA GTAGTTTGCC AGTTACCGCA
ACCATGCAGG TTTCGACTAA TTCCGCTAGT GTGGCCTTGC CACTAACCCA AACTGCTCGT
CACTATCACC TGTTGATACC GCCAACCTAC CAGCCCGATT GGCAGGTTCG GCTCAATATC
CCAACCCAAC AAGTTTTGCC CGACCCCCGA TTTTTGGGGG TGCGGCTTGA TCATGTGCAA
ATTCAAACCC CAAGTATTGC TTGGTCAAGT GTTCATTGGC ACTTATTAAT CGTGCAATTG
GCAATTATGA GTAGCCTTAT TGGGGTATTG TGGTTTTTAG CTGCTGATTG GCCGACAATT
GTTGGTATCA GCGGCATAAC AATCTTGGTG CTGGTAACAA TCACTGGTAA ATTTGTATTG
GTGGCATGGG CATGGCAACT CCGTTTATTA ATTGTAGCTG CTGCTACAAC GCTGTTAGTT
GGCTGGCTCC GTTCATTAAT TCAGAATCAG ATTAAGCATT TACTAAAACC GTATGAATAT
CGGTATTTAA TTATATTTAG TGTTATTAGC TTTTTCATTC CAGTAATGAG TATCCTATTT
CCAAATTTTG GCTCACATGA TCGGGTAATT CATGCTGATC GTTTAGGTCA AGTTGCTCAA
GGCTCAGCAT TATTATTAGA TAAATTATAT GAATTTCAGG GCCGCGAAAC CATTACCCCA
ACCACATTTT ATCTATTAGC ACTACCACTA ACTGTATTTT TTAATAATAA TGGATTAATT
ATTGAAGGAT TGTATACATT TTTGCATGCA AGCAGTGGGA TTCTTTTGGC AATAACCTTA
TTACGCTGGA AGGTTCGGCC TATACTTGCA CTTGCGGCAA TGATTTTAAT CTCAGCAATG
CCAATTCAAA TGACAATTTT ATGGTGGGGT TTTGCCCCTC AAATTGTTGG TCAGTGGTTG
ATTCTTGTAT TTTTGGCTGT TTTTAGTTTT CAATCGACTC TTCGTCCAAC CATAATCAGC
ATTGGGATAC TCAGTTTGGC TATCTGGATG CATAATGGGG TAGCACTTTT AGCAGGAACC
TGGATCGCTA GCTATTGTGC ATTAGGCTAT TGGCGCGATC CTGCCCAACG TAGGCATTAT
TGGGCTTGGT TTTTAAACTT AATAGGAATC AGCATTTTTG GGTTGTTGGC AATTTATATT
GATCTTTTTA TGACAACAGG TACAACCCAA CAACAGGTAT TGGGCTTGAC TGAATATCTG
CCAGCAGTAA TCAATGGTTT ATCTGCAAGC TTTGCGCCAA TTGGAATTAT GTTTGTAGCT
ATTTTCGGAT TATTACCATT TCTCCAATTA GAAAAAAACA AAAAAATCTT GCTTATTGCT
AGTGGATTAA CATTTCTATT GTTTTTGGCC ATTGATATCG TGTTTGGTGT ACAGGTACGT
TATAGCTATT TTATTCTGCC ATTCCTATTG ATGATTGGAA TAATATTTAT CGATCAGCGA
TTAAGCATCA TTCCATATGT CGAATCTGTG ATTATAACGC TAACACTGCT TTGTTATGGC
TATAGTTTGT ATAGTTGGTA CGATGCAATT ATCTATGGCG TAAAGCCTAG TTTGCTTGGG
CTAACCCACT AG
 
Protein sequence
MQFKRFVWVS LGLAGLSLML LVGNTLSLRL FGQIQQIDVG NWGDQNNVAG GFEQEQNALG 
ETYRWTQAEA TIRLQGYGSA SSRLLSLKIG GVPSSLPVTA TMQVSTNSAS VALPLTQTAR
HYHLLIPPTY QPDWQVRLNI PTQQVLPDPR FLGVRLDHVQ IQTPSIAWSS VHWHLLIVQL
AIMSSLIGVL WFLAADWPTI VGISGITILV LVTITGKFVL VAWAWQLRLL IVAAATTLLV
GWLRSLIQNQ IKHLLKPYEY RYLIIFSVIS FFIPVMSILF PNFGSHDRVI HADRLGQVAQ
GSALLLDKLY EFQGRETITP TTFYLLALPL TVFFNNNGLI IEGLYTFLHA SSGILLAITL
LRWKVRPILA LAAMILISAM PIQMTILWWG FAPQIVGQWL ILVFLAVFSF QSTLRPTIIS
IGILSLAIWM HNGVALLAGT WIASYCALGY WRDPAQRRHY WAWFLNLIGI SIFGLLAIYI
DLFMTTGTTQ QQVLGLTEYL PAVINGLSAS FAPIGIMFVA IFGLLPFLQL EKNKKILLIA
SGLTFLLFLA IDIVFGVQVR YSYFILPFLL MIGIIFIDQR LSIIPYVESV IITLTLLCYG
YSLYSWYDAI IYGVKPSLLG LTH