Gene Haur_3760 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3760 
Symbol 
ID5735624 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp4729624 
End bp4730826 
Gene Length1203 bp 
Protein Length400 aa 
Translation table11 
GC content57% 
IMG OID641280912 
Producthypothetical protein 
Protein accessionYP_001546524 
Protein GI159900277 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAACTC AGAAAAAATT GCTCGCTTCA CGGCCTCAGA TAGCCTTAGT TAAAATGAGC 
GAACAATCTG TGCAACAACG AATAACCGGC GCTTTAGCCG ATGAAATAGG TGCGCGTCCG
GCAACCTCGA CCGCCGAGGC TCGTGCCGCC GCCGTGATCG CCGCTCAGAT GCGTCAAGTT
GGCCTCGAAG TTGGCGTGCA AACTTTCTCC GCCGCTGCTG CGCCAACCGC AGGCTTGGGC
TTATTGGCAG CGATCGGCTT GTTGGCGCTC GGCTTAGGCT GGTGGTTTCC CTATCCTTCA
GTCGCCTTGA TTGCGCTATT ATTGCTGTTG GCAGGCCGCG AATTGCATGG CCCGCCCGTT
TTGGCTGGCT TGTTGCGCCA ACGCCCAAGC CAAAATGTGA TTGGCACACG GGCGGCAACC
CGTATTCCTC GTGCTCGCAT CGTTTTATTA TGTCACCTTG ATTCGCCGCG CATGCTCTCG
CCACGCCGTG CCAGTTGGCT GCGGGTTTGG TTACTAACGA TTCCATTTGG ATTTAGTTTG
AGCCTGATTG CGCTTGGGGT AGCGATTTTT CTACCTGCTT GGCATAGTGT GCTGTTAATT
CCAGCATTCT TTTTGTTGCT CAGTTTGCTG GTGGTGATTC GGCGTGAGTG GAAGGCCGAT
TGGACGGTTG GCGCGGTCGA TGCTGCTGCC GTTGGCACGG CAATCGCGTT GGCAGCCGAT
TGGCCGCAAC GTGAAGATGT AGAACTATGG GTCGTGGCGC TGGGCGCAGG GGCCGCCGCT
GGTTCGGGTA TTCAAGCCTT ATTGAATACC TATCCCTTCC CCAAAGCCGA AACGTGGTTT
GTTAACCTGC CGTGGCTGGG CCGTGGCAAC CTCACAATTG TGGCTGGAGA AGGATTGTGG
CGCGAACGAA AGCCCGATCC TCAACTTACC AAAATGTTCC ACGAACTACA ATCCGCCAGC
GCCCCACTGA TTCGCTCGGC CTATCGGGGC GAACGCTTGG ATAGCGCTCG GCTTTTGGCT
ATGGGCTATC ACGCGGTCAG CGTGGTTGGC TTGAAATCCG ATGGCACGGC AGCGGGCTTC
CGTCAACCAG ACGATGAAAC CCGCTTACTT TCCGTGCCAC AGATGGAATT GGCCTTACGG
GTGTTGCGGC GGGTGCTCGA CCGCTTTGCT CGCAGTCATA GCAACGAGCC GCAATTACCC
TAA
 
Protein sequence
MATQKKLLAS RPQIALVKMS EQSVQQRITG ALADEIGARP ATSTAEARAA AVIAAQMRQV 
GLEVGVQTFS AAAAPTAGLG LLAAIGLLAL GLGWWFPYPS VALIALLLLL AGRELHGPPV
LAGLLRQRPS QNVIGTRAAT RIPRARIVLL CHLDSPRMLS PRRASWLRVW LLTIPFGFSL
SLIALGVAIF LPAWHSVLLI PAFFLLLSLL VVIRREWKAD WTVGAVDAAA VGTAIALAAD
WPQREDVELW VVALGAGAAA GSGIQALLNT YPFPKAETWF VNLPWLGRGN LTIVAGEGLW
RERKPDPQLT KMFHELQSAS APLIRSAYRG ERLDSARLLA MGYHAVSVVG LKSDGTAAGF
RQPDDETRLL SVPQMELALR VLRRVLDRFA RSHSNEPQLP