Gene Haur_2330 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_2330 
Symbol 
ID5734202 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp2973897 
End bp2976494 
Gene Length2598 bp 
Protein Length865 aa 
Translation table11 
GC content49% 
IMG OID641279471 
Producthypothetical protein 
Protein accessionYP_001545098 
Protein GI159898851 
COG category[R] General function prediction only 
COG ID[COG3889] Predicted solute binding protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGCCAA GCTTTGTGCA ACAACGATTC AGGTCAGTAG TTAGTGCTCT TATCATTTTA 
ACGTTTGGTC TTGGTTCATT TAGTTGGCTG TTACAATCGG CGTTTGCCAA TACGATTGAT
GCAACACCAT ATTTAAATCC ATTAGCGCCC GATTTTGGCA TTCAAGCGGC GATTGATGCT
GCGGCTGCTC AGGGGGGCGG CACGGTGCGC TTACCCGCTG GTAGCTTTAC CCTCGAAACC
TACCTCGATC TTAAAACTGG CGTGACTTTG CAAGGGGTCG GGGCTGAGAC GATTTTAAAG
GCTGGTCGTA ACGAGCAACG GGTTTTTGTC ACGCAAACTG GCAGCAATCT TTCAACAATT
AAAGTTGCCA GCGTTACGCC ATTTCGAGTT GGTATGATCG TCTATGTCTG GCGTTCGACA
GAATTGCGCT TCTTGCCTGG CTCCTATGAA ATTATGAGCA TCAATAGTAC TAATCAAACG
ATCACGCTTG ATCGCGCGGT CAATTACCCG CTGACTGCTA ATGTTTCGCA AGTGTCGTAT
GGTTTGTACA CCAAATTGAC CGGTGCTGCC ACCCAAGGAA CCAATGTGAT CAGCGTGGCC
GATACCAGCG TGTTCAATCC AGGCGAAGGC ATCATTATTA AAGGAACTGA AGGTACAGGC
ATTGGCAATT GGGGTGTTGA GCAAAATATG GTCGATTCGA TCAATACCAG CAACAACACG
CTGACCCTCA AAAAGCCTTT GACGCTTTCA GTGCCCAATA ACTCAGTCGT ATCGCACGCC
TATTCGGCGA TTTTTGCGCT TGGAACCAAT TTCAACAATC GCTTGCAAAA TATGGGTGTA
CGCGATTTGA CGATTGAGGG CTGGAACACC AACCAAAAGC CGGCCTTCTA TGAGTTTTAT
ATTGGAGCGA TTAACTTTGT CTATTGCCGC TTTGTGACGA TCGATAACAT CACCGTGCGC
TATTGGCATA GCGATGGCGT AAGCTTGCAA TCGTGTGATC AAAGCACGGT CAGCAATAGT
CTGGCAACTG CTAATCGTGG CCATGGCTTC CATCCAGGTA CTGCTTCACG CGATATTGAA
TTTTTCAAAA TTCAAGGGAT TGGCAATTTG GGCTATGCAG CACGTGGGAC TGCTGGCGAT
GGTTTGTACT ATTGTTGGGC TAACCAACGA GTTAATATTC GCCAGAGCGT TTTCCGTAAT
AATGCTGGCT CAGGCGTTGG CGATTTAGGT GGTGGCGATA CCGATAATTC CTCACGTGAT
ACCGATAACA TCATTGAAGA TAGCATTATG GAAGGCAATT TCCGCGCTGG GATTGAGGTT
AATGGTGGTG GCAATACGGC CAATAACATT ATTCGCCGCA ACGTGATTCG CAATAACAAC
ACTGGCAATC AAGATTATGC TGGCATTAAC TTGCTTTCCA AGCGCGGCCC AGTCCAACGC
TATATCATCC AAGATAATAT TGTTGAAAAC ACAGCGGGCA GTAACCAACT CTTTGGGATT
CGTGAGGTCA ACTTGGCTGT GCCACCCACC ACCCCAGTTG ATTATCTGAC CGATTTCAAC
ACGATCACCA ATAACACGAT TTATAACCAC CCAAGCAATA ATTTGGTAGT GATTGGCCCC
AACACCGTCG CCACTGGCAA TATTTTCACT GCACCAGGCG CGGTGATCAC ACCAACGCCA
ACCAATATTC AACCAACTGC GACTGCTACA ATCGCGCCAA CCAACACTCC AACCCCAACT
GGTTCGTATG TGCCACGCTT GATTATGTAC AATGCCGATA CTGATCAAGT TATGTATGAT
CCAATTCCCA ATGGCGTGAC AATTAATTAT GCGACGCTGG GAACCCGTAA TATCAGCATT
GTTGCTCCAA CCGCGCCTTC GAGCGGGATT GGCAGCGTGC GTTTTTGGGT TGATAGCGTG
GTTTATCGCA CCGAAAGTGG TCGGCCTTAT TCAATCGCTG GCGATCAAAC CAATGGTACA
GATTTCTTGC CGATGAATCC GGCCTTGGCC CATGGAACCC ATGTGATCAA AGCAGCCACC
TACACAGGTT CAGGTGGAAC TGGCACACAA GGCACACCCT ATCAAATTGT GATTAATATT
GTTGATAGTA ATGCCACGGC TACGCCAATT CCAACCAACA CCAATACCCC TGTACCAACT
GTACCAACCG CGACCGCAAC TGCAACGAAC ACGCCAACCA ACACGCCGAC CAATACGGCA
ACCAACACGC CAACGGCAAC AGCAATTGCA ACCGCGACTA ATACGCCAAC TGAGATTGCA
ACGCCCACGG CCACGGTAAC GGAAGTGGCG ACGGCCACAC CAACCGAAAT CGCCACGATC
ACCGCAACCG CGACTGACGT TGCCACGGTC ACTGCGACAG AAATTGCTAC AGCTACGGCG
ACAGCGACGA TTACGGCAAC ATTAACGAAC ACACCAACCA ATACACCGAC TAATACTGCA
ACTGCAACGC TGACTGAGAC ACCTACGGCG ACGCTTGAGC CAAGTGTTAC ACCAAGCAAT
ACACCAACGG CGACAACCAC GGTTACGACT CCAGTGCCGT CAACCCATCA TGTTTATGCG
CCATGGGTTA CCAACTAA
 
Protein sequence
MQPSFVQQRF RSVVSALIIL TFGLGSFSWL LQSAFANTID ATPYLNPLAP DFGIQAAIDA 
AAAQGGGTVR LPAGSFTLET YLDLKTGVTL QGVGAETILK AGRNEQRVFV TQTGSNLSTI
KVASVTPFRV GMIVYVWRST ELRFLPGSYE IMSINSTNQT ITLDRAVNYP LTANVSQVSY
GLYTKLTGAA TQGTNVISVA DTSVFNPGEG IIIKGTEGTG IGNWGVEQNM VDSINTSNNT
LTLKKPLTLS VPNNSVVSHA YSAIFALGTN FNNRLQNMGV RDLTIEGWNT NQKPAFYEFY
IGAINFVYCR FVTIDNITVR YWHSDGVSLQ SCDQSTVSNS LATANRGHGF HPGTASRDIE
FFKIQGIGNL GYAARGTAGD GLYYCWANQR VNIRQSVFRN NAGSGVGDLG GGDTDNSSRD
TDNIIEDSIM EGNFRAGIEV NGGGNTANNI IRRNVIRNNN TGNQDYAGIN LLSKRGPVQR
YIIQDNIVEN TAGSNQLFGI REVNLAVPPT TPVDYLTDFN TITNNTIYNH PSNNLVVIGP
NTVATGNIFT APGAVITPTP TNIQPTATAT IAPTNTPTPT GSYVPRLIMY NADTDQVMYD
PIPNGVTINY ATLGTRNISI VAPTAPSSGI GSVRFWVDSV VYRTESGRPY SIAGDQTNGT
DFLPMNPALA HGTHVIKAAT YTGSGGTGTQ GTPYQIVINI VDSNATATPI PTNTNTPVPT
VPTATATATN TPTNTPTNTA TNTPTATAIA TATNTPTEIA TPTATVTEVA TATPTEIATI
TATATDVATV TATEIATATA TATITATLTN TPTNTPTNTA TATLTETPTA TLEPSVTPSN
TPTATTTVTT PVPSTHHVYA PWVTN