Gene Jann_1525 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagJann_1525 
Symbol 
ID3933973 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameJannaschia sp. CCS1 
KingdomBacteria 
Replicon accessionNC_007802 
Strand
Start bp1494831 
End bp1495841 
Gene Length1011 bp 
Protein Length336 aa 
Translation table11 
GC content60% 
IMG OID637903876 
Producttwin-arginine translocation pathway signal 
Protein accessionYP_509467 
Protein GI89054016 
COG category[R] General function prediction only 
COG ID[COG4221] Short-chain alcohol dehydrogenase of unknown specificity 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0574821 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAGTTC AAAAGACTTC AATCGAAACG CCCCAAAACA ACGACCTGAA CCGCCGCGCC 
GTGTTGGCCG GTGTCGGGAC CCTTGGGATT GCAGCGTCGG CCACCGGCGC AGTTCACGCA
CAGACAGAAA CGGAAGGAAA CGTGATGGAA AATGGAATGT TGGCTGGAAA AGTTGCCCTG
GTGACCGGGG CCGCACGCGG GATCGGGCTG TCGGTCGCCA AGACCTATGG TCGGCATGGC
GCAAAGGTCG TGATGCTGGA TATTGCCGAT CCATCGGCCG TTCCACCCGT CGAGGGCTTC
CGCATCGCCA ATGCCGAAGA ATTCCAGGCA GCCATTGAAG AGGTACGTGC GATTGCCCCT
GATACACTTG GGATCACGGC TGATGTTCGC GACCGTGCCG CGCTTGCAAA TGCGGTCGCC
CAAGCGACCG AGACTTTCGG CGGGTTGGAC ATCGCTGTGG CGAATGCGGG CTATGTCCGC
TGGCATGGCT TTGCCGATGG TACGGAAGAA GACTGGAAGA GCGTCTATGA CGTCAATGTC
CACGGTGTTT TCAACACGTT CTACGCCGCT ATCCCGGCAT TGCGCCAGCG CGGCGGCGGA
TCTCTGATCT CGCTCAGCTC CATCGCTGGT CGGATCGGCG TCATCGGCAA CGGGGCCTAT
AATTCTTCGA AATGGGCCGT CATCGGCATG ACGAAGCAAG CCGCGCTGGA ACTGGGCGCG
GATCATATCC GGGCCAACGC CATTGCGCCG GGTCCCGTGA ACACGCCCAT GTATCGGTCG
GAAGGCCAAA AACGCTCGAT GGGCATCGAC GTGGCCGCTT TGTCGGAGGT TGAGGCAAAT
GCAGCGCAAG ACGCGATGTT GAACCCGGCC TTGCCCTTAG GTGAGACCCC CGCGTCCGAG
CCGCAAGCCA TCGCAAACAC CGCTCTCTAC CTTGCGTCCG ATCTTTCGGC GGACGTCTCA
GGTGCCGTCC TAGACACCGC GCTTGGCTAC AACGCAAACT ACACTGGCTA G
 
Protein sequence
MTVQKTSIET PQNNDLNRRA VLAGVGTLGI AASATGAVHA QTETEGNVME NGMLAGKVAL 
VTGAARGIGL SVAKTYGRHG AKVVMLDIAD PSAVPPVEGF RIANAEEFQA AIEEVRAIAP
DTLGITADVR DRAALANAVA QATETFGGLD IAVANAGYVR WHGFADGTEE DWKSVYDVNV
HGVFNTFYAA IPALRQRGGG SLISLSSIAG RIGVIGNGAY NSSKWAVIGM TKQAALELGA
DHIRANAIAP GPVNTPMYRS EGQKRSMGID VAALSEVEAN AAQDAMLNPA LPLGETPASE
PQAIANTALY LASDLSADVS GAVLDTALGY NANYTG