Gene Haur_2980 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_2980 
Symbol 
ID5734852 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp3760289 
End bp3762613 
Gene Length2325 bp 
Protein Length774 aa 
Translation table11 
GC content52% 
IMG OID641280124 
Producthypothetical protein 
Protein accessionYP_001545746 
Protein GI159899499 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGAGCC GACGTGCTGG AATGTTAATT ATTGCCATTC TCAGTTTGGC GCTGCTGGTT 
GGCTTGATTG GCTATGCCTT GTTGGTGCAG CCAGTTGTCG CCCAACTTTC ACCAACGCCA
CCAACCCTCT TTACCCAGTT GCAATACCCT TTCAATCAGA GTTTGATTCC AGTGGGCAAA
GTGCTGACGG TGCATTCGCG CTCGTGGGGC AGCCAGCCGA TTGAGAACGT CGAATTATGG
GCTGATGGTC AGCCATGGGC GCTGCAAGCA GCCAATAATG CAACGCTATA CGAAGGCTTA
TTTGCTTGGC AATCTCTTGG CTTGGGCGAC CACAGTTTGG TTGCTCGCAC CAACGATACC
AGCAAAATTC CCTCAACTTC GGCGATTGTG CGCTTGAATG CTGTGATGCC ATATGAGCCT
GTTGCGACCT TCCAAGTGCA ACAGTCAGCA GGTGAAAGCC TGGAAAGCCT GAGCAAAGCC
TTTGACGTTG CGCCCCAAAC TCTGCTGAAG CTCAACCCAC AGCTTGGCAA TTTGCCCTTG
AACCAGCCCT TGGGCAGCGA TCAATCGATT ACGATTCAGT TGGCTGGTCA ATTTACCCCC
ATCAGCACTA CCACCAGCCT GAGCGCGACT CAAACGCCAC TTCCGGCTGA TGTTGCCAAT
TTGCCGTTGG CTACACCCTA CGACCCCAAT GCAATCTGGT TTGATTTGCA GCGCCGTTTT
GGCACAGCCA ATGTGCCGTT AGCTCCTGAG GCGCTGGTTG GCAGCAGCAA TTGCCAAACC
CAGTTGGTGT TTACGCCCAC ATCAGATGAT GCTGATGGCT TTTTTATCTA TCGGGCTGGG
CCAACGCAAA ACCATTTTGA GTTGGTGGCA ACCGTTGCCG CCAATGGTTC TGGCCCACAG
CTTTGGCAAG AACCAGCCGA TTTTGGCCAA ACGCTCTATT ATGTGGTGGC GTTTAATCCA
GCTGGGGCAG CGGCTAGCCC AGTGGTGGCA CTAGAGAATG CTGATTCGGC TTGTGCCACG
TTGCAAGTGC CTCAATTTGC CAGTTTGCTG CTCACGCCAA AAATAGCCGT CAGCGATGTT
TATTGTTATA GCTCGTTAGC AAACGGCCCA TGGCAACGAG TGCCCAATCA AGGCTTTTTG
CTGCCAATTG CTGGCGGCTA CGATTTAGCC AGCGCTTTAC CACTAACCGC GATCAGCGCC
AGCCAAACTT GGAATTTGGA ATGCTGGGGC TGGAATGGCA CGCAGCCACA ATTATTGGGC
AACACCAGCA CCACATTAAG TCTTGGGCAG GAGCAAATAA TTGCCCTCGA TGCCGATTTA
TTCAGTTTGC AAGGCCAATT GAACACTAGC ACCCAAATTC AGCAAGTGCC TACGCTCAGC
ACAATTGCGC CGCCAACCAA CCTTCGCTTA ACCACCGATC TTGAGGAGTG TGTGCAAGCA
GCGCCGCAAG CTGATGATTT TTGGCGGACG GCTTGTAGCA CCAATCTGGC GGCGGGAGCG
ACCGTTCTAA CCTGGCAATG GAGCGAACAG GCCTGTTTTC CAAGCGCTGA CGGCCAAGAT
TGTAGTGCCA ACGCCAATCT TGAAGGCTTT CAAATTAACG ATCGCTTGGC TGGAACGCCG
CTCGAATTAA CCCGCGTTAA TCCTGAACAA CGCTTGGTGT TTTTGGCTCC ACGTACCATG
CCCAGCCCAA CCGATGAATG TTTGAGTGTG CAAGCTTTTC GTGGTTTGGC CGTTTCGCTG
GATAGTGAAG TGCTCTGTTT GCCAGCGCTT AAATTGGCAG CAGGTAGCTA TACCCTCGCC
CCAAGTTTGT TTAATCTGAA TGCAGGCGTG GCTCAGCAAA CGGTTGGTGA TGGCTGCCCA
GCCTTGCCGA TCAACCAAAG CCAATCAACC AGCCTGAATT ATCCCTATGT CAGTAGTTTG
TTGCTACTTC AGCGCAATGC TGTGGCCGAA ACTGCCTGTC GGCGCTACAC GGCCAGCTGG
TTTGATGGCA GTGTGAGTTT TATTTTGCCG CGACTCGATC AATCAGTTGG TGGCTTAGAA
TTAAGTTTTA GTGTCGCAAG TCAACCTGAG CCAGCCAACG AAGCCTGCCC CGCCAGCGCC
AGCCTGCAAA TCGGGAGCGC TAGCCAAACA ATTGATTTAG CAACTTGGCC CACGAATGGC
TTGGTTACAG TTGCTGTTGA TCAAGCCTTG CTCGAACAAA TTGGGCAATC GAATGACCCC
AAATTGAACT TTACACTTCA AGCCGAGCGC CAACTTGCCA ATCGCAACCA ACGTTGCAGT
AGCGAGCTAG GCGATTTTTC ATTGAAACTA ACGGTGCAAC CATGA
 
Protein sequence
MMSRRAGMLI IAILSLALLV GLIGYALLVQ PVVAQLSPTP PTLFTQLQYP FNQSLIPVGK 
VLTVHSRSWG SQPIENVELW ADGQPWALQA ANNATLYEGL FAWQSLGLGD HSLVARTNDT
SKIPSTSAIV RLNAVMPYEP VATFQVQQSA GESLESLSKA FDVAPQTLLK LNPQLGNLPL
NQPLGSDQSI TIQLAGQFTP ISTTTSLSAT QTPLPADVAN LPLATPYDPN AIWFDLQRRF
GTANVPLAPE ALVGSSNCQT QLVFTPTSDD ADGFFIYRAG PTQNHFELVA TVAANGSGPQ
LWQEPADFGQ TLYYVVAFNP AGAAASPVVA LENADSACAT LQVPQFASLL LTPKIAVSDV
YCYSSLANGP WQRVPNQGFL LPIAGGYDLA SALPLTAISA SQTWNLECWG WNGTQPQLLG
NTSTTLSLGQ EQIIALDADL FSLQGQLNTS TQIQQVPTLS TIAPPTNLRL TTDLEECVQA
APQADDFWRT ACSTNLAAGA TVLTWQWSEQ ACFPSADGQD CSANANLEGF QINDRLAGTP
LELTRVNPEQ RLVFLAPRTM PSPTDECLSV QAFRGLAVSL DSEVLCLPAL KLAAGSYTLA
PSLFNLNAGV AQQTVGDGCP ALPINQSQST SLNYPYVSSL LLLQRNAVAE TACRRYTASW
FDGSVSFILP RLDQSVGGLE LSFSVASQPE PANEACPASA SLQIGSASQT IDLATWPTNG
LVTVAVDQAL LEQIGQSNDP KLNFTLQAER QLANRNQRCS SELGDFSLKL TVQP