Gene Jann_3816 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagJann_3816 
Symbol 
ID3936296 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameJannaschia sp. CCS1 
KingdomBacteria 
Replicon accessionNC_007802 
Strand
Start bp3905708 
End bp3907276 
Gene Length1569 bp 
Protein Length522 aa 
Translation table11 
GC content63% 
IMG OID637906194 
Producttwin-arginine translocation pathway signal 
Protein accessionYP_511758 
Protein GI89056307 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.961587 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0948956 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACGAT TGTCCAATAT GGGCGGGATG AGCCGCCGAA CGATGCTGCA AGGCACCGCC 
GCCGTCGGCA CCGCCGCGCT GATCACACCC TGGGGGACGC CGCTGCGCGC GCAGCCTGTG
CAGGGCGGAA CATTGCGGGT TGGCATGGCG CACGGCTCAA CCACCGACGC GCTGGACCCG
GGCACCTGGG AAGCAGATTT CATGATCTTT CAGGCCCATA CCCGCAACAA CTACCTGACC
GAGATCGCAG CAGACGGCTC CCTCGTGCCG GAACTGGCCG AAAGCTGGGA AGCGTCTGAG
GATGCCGCGA CCTGGACCTT CACGATCCGC TCGGGCGTTG AATTCCATTC TGGCCACGTG
CTGACGGCGG AAGACGTTGT CGCCTCCATC AACCACCACA GGGGCGAAGA CAGCACCTCT
GCCGCTGCGC CTATCGTGTC GGCCATCACC GATATGACGG TGGATGGCAT GAACGTCGTG
GTCACGCTGG CCTCCGGCAA TGCGGACTTC CCGTTTGTGA TGTCCGACTA TCACCTGCCG
ATCCTGCCCG CCCAGGCCGA TGGCACAATC GACCCCAACA CCCAGGACGG CTGCGGCCCG
TTCCGCATTG TTGAGATCGA ATTCGGCGTT GGCGCCTCCT ACGCGCGCCA TGACGGCTAT
TGGAAAGAGG GTTTGCCCCA TTTCGACGCG ATTGAGCAGA TCGTCATCTT CGACGATGCC
GCCCGCCAGA ACGCGCTGAT TTCTGGAGAG GTCGATTACA TCGACACCGC CGACCTCAAC
ACCGTGCACC TGCTGGAGCG CGCGCCGGGG ATCGAGATCC TGTCCGTGAC CGGCACGCAG
CACTATGGCC TGCCGATGGA CACGCGGGCG GAGCCGTTCA ATGATCCCAA CGTCCGTCTG
GCGCTGAAGT ATTCGATTGA CCGGACCCAG CTGGTGGACA CGATCCTGAA CGGCTACGGC
TCCATCGGCA ACGACCACCC CATCGGCTCC GGCCAGCGGT TCTTCAACAC CGAGTTGGAG
CAGAAGGAAT ACGATCCGGA TCGTGCGCGC TTCCACCTGG CCGAGGCCGG GATGGAAAGC
CTCGACGTGG ACATCCACCT GGCCGACGCC GCCTTCGCGG GTGCGATGGA CGCGGGCGTG
TTGTTCTCGG AAAGTGCTGC GGCGTCCGGC ATCAACCTTA ACGTCGTGCG CGAACCCAAC
GACGGCTACT GGTCCAACGT CTGGATGCAG AAGCCCTTCG CGGGCACCTA TTGGGGCGGT
CGCCCGACCG AAGACCTGAT GTTCGCGACG GCCTACGAGC GCGGCGTGCC ATGGAACGAG
ACGTTCTGGG ACAACGAGCG GTTCAACGAC CTCCTGCTTC AGGCCCGCTC CGAGCTGGAT
GAGGACCTGC GCCGCGACAT GTATTTCGAG ATGCAGCAGA TCGTGTCCGA CGATGGCGGG
ATCATCATCC CGATGTTCGC CAATTACGTC GGCGCCTACA GCGATGCCCT TGCCCATCCC
GAGCAGGTCG CGTCGAACTG GCGCAATGAC GGCCACCGGA TAGGCGAACG CTGGTGGTTC
GCAGCTTAA
 
Protein sequence
MKRLSNMGGM SRRTMLQGTA AVGTAALITP WGTPLRAQPV QGGTLRVGMA HGSTTDALDP 
GTWEADFMIF QAHTRNNYLT EIAADGSLVP ELAESWEASE DAATWTFTIR SGVEFHSGHV
LTAEDVVASI NHHRGEDSTS AAAPIVSAIT DMTVDGMNVV VTLASGNADF PFVMSDYHLP
ILPAQADGTI DPNTQDGCGP FRIVEIEFGV GASYARHDGY WKEGLPHFDA IEQIVIFDDA
ARQNALISGE VDYIDTADLN TVHLLERAPG IEILSVTGTQ HYGLPMDTRA EPFNDPNVRL
ALKYSIDRTQ LVDTILNGYG SIGNDHPIGS GQRFFNTELE QKEYDPDRAR FHLAEAGMES
LDVDIHLADA AFAGAMDAGV LFSESAAASG INLNVVREPN DGYWSNVWMQ KPFAGTYWGG
RPTEDLMFAT AYERGVPWNE TFWDNERFND LLLQARSELD EDLRRDMYFE MQQIVSDDGG
IIIPMFANYV GAYSDALAHP EQVASNWRND GHRIGERWWF AA