Gene Jann_1349 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagJann_1349 
Symbol 
ID3933796 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameJannaschia sp. CCS1 
KingdomBacteria 
Replicon accessionNC_007802 
Strand
Start bp1313539 
End bp1314684 
Gene Length1146 bp 
Protein Length381 aa 
Translation table11 
GC content62% 
IMG OID637903699 
Producttwin-arginine translocation pathway signal 
Protein accessionYP_509291 
Protein GI89053840 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1638] TRAP-type C4-dicarboxylate transport system, periplasmic component 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.970242 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGATCACG ACACTCAAGA GCTGAAGTCG GCAGAGCGCC GCAATTTCCT GAAGCTGACC 
GGCGCAGGTG CATTCACCGC CGCCATGGTC GCCGGTGGCG CGGGCATGCT GTGGTCGACT
GAGGCCGCAG CCCAGACCCA ACAGGAAGAG GCCGCGCGCG AAGCCGCAGC CGAGCACATT
ATGACGCTGG CCACGGCCTA TGTTCTGGGT GCCTCGCGCA GCTATCCGAT CATGCAACTG
GACCTGAAAG AGAATATCCA GAACATGACC AACGGCAAAG TCTATGTGCG TCTCGCGCCG
GGCGGTCAGT TGGGGGCAGG GGGCGCTTTG GCTCAGGCCG TGCAATCCGG CACGATCCAA
TGCGCGCAGC ACTCGCTGGC CAACTTCGCA CCGTTTGCCT CCGCCGTTGA TCTGATCAAC
CTGCCGTATT TCTGCGGCTC CAACCAGCGC TTCACCAACC TGACCTCGTC GGATGCTTGG
AAAGAAGAAG TGGACCCGCG TATCGGAGAG TCTGGTTTCA AAGCCCTGAT GTATATCGTC
ATCGACCCCC GCGTCGTGGC CGTGCGCCAG GGCGGCGACG CGATCATCAC GCCCGCCGAC
ATGGAAGGCG TCAAGTTCCG CGTTCCCGGA TCTGCAATGT TGCAGCAATA CTACCGTATG
GTCGGCGCCA ACCCGACGCC CGTGGCCTGG GGTGAGACAC CGTCCGCGAT CCGCCAGGGT
GTGGCCGACG CACTCGACCC GTCCGTGGGC GCATTGCACG TCTTCGGCTT TGGTGAAATC
CTCAGCCATG TGACCTTCAC GCAGGCCGTG CCCGACAGCC AAGTCTACTC GGTCAATCTG
GAATGGTTCA ACTCGCTGCC TGCCGACGTG CAGGAAGGCA TTGAGTTTGC AGGCGAAGTC
ACCCAGCAGC AGAACCTGGC CAAAGTGCCC TCGGCGCGGT CCTACGCGAT GTCCCAATTG
TCGGCAAATG GGGTGGAGTT CCACTCCCTG TCGGACGATC AACTGGCCGA GTGGCAGTCC
GTGGGCGGCT ATCAATTGCC CGATTGGGAC GAGTTCAAAG TCGATCTGGC CGGCTCCATG
GAGACCTTCG CCCGTCTTGA GGAAGCCGCA GGCACCGCCA GCCGCTACTA CGTCCACGAC
GCCTAA
 
Protein sequence
MDHDTQELKS AERRNFLKLT GAGAFTAAMV AGGAGMLWST EAAAQTQQEE AAREAAAEHI 
MTLATAYVLG ASRSYPIMQL DLKENIQNMT NGKVYVRLAP GGQLGAGGAL AQAVQSGTIQ
CAQHSLANFA PFASAVDLIN LPYFCGSNQR FTNLTSSDAW KEEVDPRIGE SGFKALMYIV
IDPRVVAVRQ GGDAIITPAD MEGVKFRVPG SAMLQQYYRM VGANPTPVAW GETPSAIRQG
VADALDPSVG ALHVFGFGEI LSHVTFTQAV PDSQVYSVNL EWFNSLPADV QEGIEFAGEV
TQQQNLAKVP SARSYAMSQL SANGVEFHSL SDDQLAEWQS VGGYQLPDWD EFKVDLAGSM
ETFARLEEAA GTASRYYVHD A