Gene Haur_1753 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_1753 
Symbol 
ID5733640 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp2040040 
End bp2041191 
Gene Length1152 bp 
Protein Length383 aa 
Translation table11 
GC content49% 
IMG OID641278895 
Producthypothetical protein 
Protein accessionYP_001544524 
Protein GI159898277 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0631867 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAACGAAT TCGAGCCGAT CTATCATCCT GAAAGTCGCC CGCTGAGTGT ATGGCTTGCG 
CCAAGTAGCA AACTTGCCCG TGCTGATCTG GCGCGTGAAT GCCGGATTCG TGGGCTATTG
ACCGAGCGCG ATCCTAGCCA AGCGCCGAGT GAGATTCGTG AGTCAATCGC TGCAAGTCAA
GGGGTGATAC TCTATATTGA TTCGTTGAAG GTCATGGTTA ATCAACCGCT TGGCGTTGCT
ATTCAATACC TTCAACAACA CCGCGAAATC CCATGGTGGC TTATTTGTGA TGGGGTCGAA
ATCGCCGAAG TCTATGAATC AGAAACCTTT AAGCACCTCC CATTTGAGCA ACTACAACAA
AGCTGGCAGC TCGACCAACA TAACCCAATC GGTATCGCCC AACAATTACT CAAAAGCATA
TTTCGGCGCT TTATCCGTCA AACTAATCCT TCGTGGTTGA GCCTACTGGC AGTTTCCCAA
AACCCTGCAC CTGCTGCAGG AACACATCTT GCACTTGATT GGAGCGATTG GACGAAAGAT
TATACGATCA ATCCTGCGGC TTGGCCCGCA ATTCAGCAAG CAATGCTTGA TGTTAAGAAT
GTGATTGTTG AGCAAGCGGT GAAGCGTTTG CTGATTCTGC CGCAGGTGCA TTTAACAGCT
GCAATGTTGA TTGGTGCGGT GATTAATGAG CGAGTTGCTG GGCCAGCCCA AATGTGGGTA
GCTAATAGCT TCAATAATGT CCTAACTTGG TGGGATTGTA ACGATCAAGC TACGGCTCAT
GGCATTACGC CGAAGGCAAC TGAACTCAAT CATGGACCTG ATGTGAGCAT GGAATGGGCG
ATTACCCAGC CAGCAGCAAA AGTCCAATTG CCGATTGAGC GTTATCTTGC GCAGCATTTG
ACCGATCGTG TGAAACAACG CACGCTTTAT ACCCGCGAAG GCATTGAGCA TAGCAAGCGA
GCCAGTGCGG TAGCTCAGCT GTTTCGGAGT GAGCTGCTAG CATCGCAAGC GGATGTTGTG
CATTGCTTTG CGGCAATTCC GGCAGCGTTG GCGCTATGCT TTGGCCGGAA ACTGAATGCA
TGTCCACCGA TTCAATGCTA TGAATTGAAG GGCTATGACG ATTACAAACC ATCATGGTTG
ATTAAGGCTT AA
 
Protein sequence
MNEFEPIYHP ESRPLSVWLA PSSKLARADL ARECRIRGLL TERDPSQAPS EIRESIAASQ 
GVILYIDSLK VMVNQPLGVA IQYLQQHREI PWWLICDGVE IAEVYESETF KHLPFEQLQQ
SWQLDQHNPI GIAQQLLKSI FRRFIRQTNP SWLSLLAVSQ NPAPAAGTHL ALDWSDWTKD
YTINPAAWPA IQQAMLDVKN VIVEQAVKRL LILPQVHLTA AMLIGAVINE RVAGPAQMWV
ANSFNNVLTW WDCNDQATAH GITPKATELN HGPDVSMEWA ITQPAAKVQL PIERYLAQHL
TDRVKQRTLY TREGIEHSKR ASAVAQLFRS ELLASQADVV HCFAAIPAAL ALCFGRKLNA
CPPIQCYELK GYDDYKPSWL IKA