Gene Jann_3050 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagJann_3050 
Symbol 
ID3935521 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameJannaschia sp. CCS1 
KingdomBacteria 
Replicon accessionNC_007802 
Strand
Start bp3081031 
End bp3082650 
Gene Length1620 bp 
Protein Length539 aa 
Translation table11 
GC content58% 
IMG OID637905421 
Productextracellular solute-binding protein 
Protein accessionYP_510992 
Protein GI89055541 
COG category[E] Amino acid transport and metabolism 
COG ID[COG4166] ABC-type oligopeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.423075 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACACGTC ACGCCCTACT TGCTGCCACA AGCGCGCTTG TACTCGCCCT GCCCGCCTAT 
GCGCAGGAAA CGCACCCTGA AACTGGTGAA GCGCTGGCTG CCAATCAGGA TTTTTCCTAT
CGCCTGCTGG ATCAGTTTCC ATCGCTCGAC CCGCAGCTGA TTGAGGAAAC CGCGGGCGGA
CACGTCGGGC GGCAGTTGTT CGAAGGGCTC TTGACACAAA ACGCGGACGG GTCCCTGCGC
CCGGGTGTGG CAACAGAATG GTCAAGCGAC GACAACCAGA CATGGACTTT TACGCTGCGC
GACGATGCGC GCTGGTCAAA CGGCGATCCG GTCACAGCCA ATGACTTCGT TTATGCTTGG
CGTCGCGCCG CTGATCCGGT GACCGCATCC GAGTACTCTT GGTACGTCGA GCTGACACAG
ATGACCAACG CGGCCGAAAT CATCGCCGGT GAAATGCCGA CGGAGGAATT GGGCGTGCGC
GCCATCGACG ATCACACGCT GGAAGTCACG CTGAATGCGC CCCTGCCCTA CTTCCCGCAG
ATGGCCGTGC ACTATACCTT GATGCCGACG CACCAGCCCA CGATTGAAGC CCATGGATCT
GACTGGACCC AACCTGAGAA TATCGTCAGC AATGGCGCTT ACATCCTGAC CGAGATCGCA
ATCAACGAGT ATTTCCGGCT GGAACGTAAC CCCGAATATT GGGGTGCCGA TGACGTGATC
ATTGACAGCG TTACGGGCTA TGTCATCAAC GACGCCAATC AGGCGCTCAG CCGCTTTCAG
GCCGGTGAGT TTGACATGAT GGACGACCTT CCGGCGGGAT CATATCCCGA TCTGGAAGCC
GAGATGCCAG ATACGGTCCA TGCGACGCCG CGCCTGTGCA CCTATTACTA CCTGATCAAC
CAGTCCGAAA GCGGGGCCGA GGCATTGCAG GACGTTCGCG TGCGCACGGC TTTGAGCTAT
GCCATCCGGC GTGAGGTGAT CACCGACCAG ATCCTTCAAG CGGGTCAGCG CCCCGCCTAC
AGCTTCACCC ATTGGGCAAC AGCCGATTTC GAAATGCCCG ACATTGCCTA CGCCAACATG
ACCCAGGACG CACGCATGGA AGAAGCCATG CGCCTGATGA CCGAGGCTGG TTACGGGCCG
GACAACCCAC TGGAACTGGA CCTTATCTAC AACACGTCAG AGAACCACCG GCAGATCGCA
ATCGCCGCCT CACAGATGTG GGCACCGCTG GGGGTTGAGA TCTCGCTGTC GAACTACGAA
TGGCAGAGCT ACCTTGATGT CCGCGGCAAC CAAAACTTCG ATCTCGGCCG CGCGGCATGG
TGTGGCGACT ACAACGAGGC GTCGACATTT CTTGATCTGC TGACATCGAA CAATGACAAT
AACGATGGCA AGTTCGTGAA CGCCGACTAT GACGCCCTGA TGGCAGAAGC GGCGGTGACG
GCCGATCCGA GCGCTCTCTA CGAGCAAGCC GAACAGATCC TTGCTGACCA AATGGCACTG
ATCCCGATCT ACCACTATTC CCAGAACTTC GTGCTGGACC CAACCATCCA CAACTGGCCG
ATGGAGAACG TGGAAAACAA TTGGTACGTG CGCGATCTCT ACCGCGTCGC GTCCGAGTGA
 
Protein sequence
MTRHALLAAT SALVLALPAY AQETHPETGE ALAANQDFSY RLLDQFPSLD PQLIEETAGG 
HVGRQLFEGL LTQNADGSLR PGVATEWSSD DNQTWTFTLR DDARWSNGDP VTANDFVYAW
RRAADPVTAS EYSWYVELTQ MTNAAEIIAG EMPTEELGVR AIDDHTLEVT LNAPLPYFPQ
MAVHYTLMPT HQPTIEAHGS DWTQPENIVS NGAYILTEIA INEYFRLERN PEYWGADDVI
IDSVTGYVIN DANQALSRFQ AGEFDMMDDL PAGSYPDLEA EMPDTVHATP RLCTYYYLIN
QSESGAEALQ DVRVRTALSY AIRREVITDQ ILQAGQRPAY SFTHWATADF EMPDIAYANM
TQDARMEEAM RLMTEAGYGP DNPLELDLIY NTSENHRQIA IAASQMWAPL GVEISLSNYE
WQSYLDVRGN QNFDLGRAAW CGDYNEASTF LDLLTSNNDN NDGKFVNADY DALMAEAAVT
ADPSALYEQA EQILADQMAL IPIYHYSQNF VLDPTIHNWP MENVENNWYV RDLYRVASE