Gene Haur_1096 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_1096 
Symbol 
ID5732987 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp1255070 
End bp1256815 
Gene Length1746 bp 
Protein Length581 aa 
Translation table11 
GC content51% 
IMG OID641278234 
Producthypothetical protein 
Protein accessionYP_001543872 
Protein GI159897625 
COG category[R] General function prediction only 
COG ID[COG3889] Predicted solute binding protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0122577 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAATTGC GCTCATTGCG GTCGTTGTTG ACTGTTGATC GTGCTCGGGT CGGCTCATTA 
TTAAGTGTTT TATTGGTAGG CTTGTTGGCT TGGATGCTGC CGCAGACCAA TGCTCAGCCT
GCTTTCGCTG CATCAAGCAC CGTCGTGATT AGCCAAGTTT ATGGGGGTGG CGGCTCTGCT
ACGGCTACTT ACAAGAGTGA TTACGTAGAA TTGTTCAATT TGAGTGGTTC TGCTGTCTCT
TTAAATGGCT TGTCGATTCA ATATGCTTCA AGCACAGGGA ACTTTAATGG TGTTTTCGCT
TTACCAAATG CTACAATTCT ACCTGGCAAA TACTATCTCG TACAGCTATC TCTGGGTACA
GGTCTAGGCG ATATCCCAAC TCCAGATGCA GCTTCTGGAA CTAATATCGC TATGTCTGCA
ACTGCAGGTA AGGTTATTAT TGCGAATACC ACTACAGCAT TAGGGTGTTC CACAAGCGCG
ACTTGTACTC CTGCTCAACA AGCCCAAATT ATTGATCTCG TTGGTTATGG CACAGCTGCT
AATTACTTCG AAGGTAGTGG GCCAACAGGT GCGCCAAGCA ATACGACGAG CGTTATTCGT
ACCAATCCTT GTGTTGATGC CGATAATAAT GCAACTGAAT TTAGCGTGGG TACGCCAAAC
CCACGTAATA CTGCCAGCCC TACGTTGAGC TGTTCAGCGG CCACCAATAC ACCAACGAAC
ACGCCGACTA ATACCGCGAC CAACACACCA ACCAGCACTC CAATTGTGCT TGGGGGCGAT
AATAATATCC TGTGGGATCA GCTCTATCAC AGCGCCACTG CTGTAAATCC TCAACTTGAG
CTTGTGCCAA ACGAGAGCTA CAGCTTTTTG CATAGTGCTA GTGGCACAAT CGACGAAACC
ACGGCTGTGA CGATTTCGGC ATTAACTGAT GCGCTTGATG TGCAAACGGT TAGCCTGCGC
TACTGGGATG GAGCGAATTC GACTACAATT CCAATGACGA GGATTAAATC GTTGAGCGCT
AGCTTTCGCA GCCAGCCAAT CCATAGCTAC GATTTGTGGC AGGCTAGCAT TCCAGCTCAG
CCAATCGGCA CAAGCATTTT CTATCGGGTG ATTGCTCAAG ATGGTTCGGC CTCAGCCTAT
TTGAAGCACA ATAATGGCCA ATATGTGAAT CCGCTTGGCC AACATGTGCG GGGCTTCAAT
GATGATCCCG ATGATTATAG CTACACGGTT TTAGCGGCAA ACCCAACTGC TACCCCAACG
AATACCCCAA CTAACACGCC AACGGATACC GCTACGCCGA CGGCGACCAA TACGCCAACC
AATACACCAA CCGATACGGC AACGCCAACG GCGAGCAACA CGCCAACCAA TACGCCGACC
GATACGGCAA CACCAACCAA CACGCCAGTG GCTCCAACGG CAACCGATAC CGCTACGCCA
ACGGCGACGA ACACGCCAAC CAATACGCCG ACCGATACGG CAACGCCAAC CAACACGCCA
GTGGCTCCAA CGGCAACCGA TACCGCTACG CCAACGGCGA GCAACACACC AACCAATACG
GCTACGCCAA CGATCACGGT GACGAGAACA CCGACACATA CGCCAACTAA TACAGCAACG
CCAACGCGCA CGGCGACCAA CACGCCAACC AATACGGCGA CATCGACGGC GACGAATACG
CCAACCGTCA CCAATACGCC AATTGCTCAG CAGCATAAAG TGTTCTTACC ATGGGCCAGC
AAATAG
 
Protein sequence
MQLRSLRSLL TVDRARVGSL LSVLLVGLLA WMLPQTNAQP AFAASSTVVI SQVYGGGGSA 
TATYKSDYVE LFNLSGSAVS LNGLSIQYAS STGNFNGVFA LPNATILPGK YYLVQLSLGT
GLGDIPTPDA ASGTNIAMSA TAGKVIIANT TTALGCSTSA TCTPAQQAQI IDLVGYGTAA
NYFEGSGPTG APSNTTSVIR TNPCVDADNN ATEFSVGTPN PRNTASPTLS CSAATNTPTN
TPTNTATNTP TSTPIVLGGD NNILWDQLYH SATAVNPQLE LVPNESYSFL HSASGTIDET
TAVTISALTD ALDVQTVSLR YWDGANSTTI PMTRIKSLSA SFRSQPIHSY DLWQASIPAQ
PIGTSIFYRV IAQDGSASAY LKHNNGQYVN PLGQHVRGFN DDPDDYSYTV LAANPTATPT
NTPTNTPTDT ATPTATNTPT NTPTDTATPT ASNTPTNTPT DTATPTNTPV APTATDTATP
TATNTPTNTP TDTATPTNTP VAPTATDTAT PTASNTPTNT ATPTITVTRT PTHTPTNTAT
PTRTATNTPT NTATSTATNT PTVTNTPIAQ QHKVFLPWAS K