Gene Haur_3698 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3698 
Symbol 
ID5735547 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp4651088 
End bp4652641 
Gene Length1554 bp 
Protein Length517 aa 
Translation table11 
GC content51% 
IMG OID641280850 
Producthypothetical protein 
Protein accessionYP_001546462 
Protein GI159900215 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.163977 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGTTCTT TGAAATATAT AAGTCTGGCG TTATTACTCG TGGCCTGCGG TGGTCAAAGC 
CCTGCAACGA CCACGCCAGC AGCAACCACC AATTTACCAG CCCAAGCCAC CGCCACCATC
CAAACTGGCT CAGCAGCAAC TACTATTCCA ACAACCGTGC CCCAAGCAAC GGGTATGCCG
CAAGCAACGG TTGCTACGGC CAACCAAATT AATTTAAGTG CGCCGCCTGT GGATGCGGTG
TTGACCGATT CGGTGCGCGT GGCTGGCACG ATCGTGCTTA CCCCATTCGA AAAAACCTTG
CGCCTCGTGA TTCAAACCAA CGACGGCAAC ATTCTGTATG AGGGGCCAAT TAATACCACT
GGCGAATATG GCAGCAGCGC AACCTTCGAT GTGACCGTGC CAATCGTCGC AGCAGCAAGC
GGCCCAGGTG TGATCAAGGT GATCGAAGAT GATATGAGCG GTGAATTGCC CTATCGCACA
ATTGCCGAGC AACCAGTCCA ATTTACTTCG ACTTCTGCCG AACCGACGCC TGCCGAGCCA
GGAATTTTGA TCGAATTGAC TGAACCAGCG ATGCATGCGG TCGTTGGCAA TCCATTGAAT
TTCAAAGGCA CGCTCTCGGC AATGCCCTTT GAAAAAAATG TCGTGATCGA AGTCTATGAT
AGTGAATTGC ATTTGCTCGG TCAAACCAGC GTGATTGCCG ATGGCGAATA TGGCTCGGCT
GGAACATTCA GCGGCAGTAT CAATTTTCAA GCCCCCTTGA GTAGCCGCAT CGGTCGGATT
GTGGCCTATA CAACCTCGCC CAAGGATGGT TCAGTCGTTG GGCGTGACGA AGCAACTCTG
ACGTTGCCTG CTTGGAATGG CACAGGCGCA TATTTGGCCC AACCTGCGCC CGAAACCAGT
GCCTTCTTGC CTTTGCATGT TGAAGCCGTT GGTTTAAGCA GCGATACTTA CACTGTGCGT
TTGCGCTACG CCGATGGCAC CCTGTTGGAA AACACTACCC AAGCCTACAA CGGGTATTTG
GCCTTGAGTT TGATGTGGGA CAATGCTGCA CCCATTTTAC CCAACCAAAG CGCAATTTTA
GAGTTGGTCA AGGCTGATGG CACAGTTGAG TTGACCCAAA ATCTGTATAT GCAAGATCTA
ACCAGCCAAC CGACTACGAG TGTCGAAGTC TCTTGGTTAG CTGGCGAAGG CTCGATCAAT
GGGATTCGGA TTTTACCAAA AACCTCAAGT GTCGCCAGCG CTGCCTTACG CGAGTTAGTT
TGGGGTCCAG TTGGCAAAGA TTCAGCCTAT AGCACAGCGA TTCCTAGCCC TAAAATTATT
GCCGATTACA CTGGTGATAA AACTGGCTGG ACTGGGCGGG TGCATCTGCG CTCAGTGCGG
ATCGAAGGCG ATATCGCCTA CGTCGATTGG AGTCGCGAAA TGCGAGCATG GGGCGGTGGG
TCAATGCAAC TTGAATCACT GCAAGCTCAA GTTGACCTAA CGCTCAAGCA ATTTTCTCAA
GTCAAGCAGG TTGTTATGAC GGTTGAAGGC AGTGAAGAAG TGCTCCAACC ATAA
 
Protein sequence
MRSLKYISLA LLLVACGGQS PATTTPAATT NLPAQATATI QTGSAATTIP TTVPQATGMP 
QATVATANQI NLSAPPVDAV LTDSVRVAGT IVLTPFEKTL RLVIQTNDGN ILYEGPINTT
GEYGSSATFD VTVPIVAAAS GPGVIKVIED DMSGELPYRT IAEQPVQFTS TSAEPTPAEP
GILIELTEPA MHAVVGNPLN FKGTLSAMPF EKNVVIEVYD SELHLLGQTS VIADGEYGSA
GTFSGSINFQ APLSSRIGRI VAYTTSPKDG SVVGRDEATL TLPAWNGTGA YLAQPAPETS
AFLPLHVEAV GLSSDTYTVR LRYADGTLLE NTTQAYNGYL ALSLMWDNAA PILPNQSAIL
ELVKADGTVE LTQNLYMQDL TSQPTTSVEV SWLAGEGSIN GIRILPKTSS VASAALRELV
WGPVGKDSAY STAIPSPKII ADYTGDKTGW TGRVHLRSVR IEGDIAYVDW SREMRAWGGG
SMQLESLQAQ VDLTLKQFSQ VKQVVMTVEG SEEVLQP