Gene Jann_1501 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagJann_1501 
Symbol 
ID3933948 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameJannaschia sp. CCS1 
KingdomBacteria 
Replicon accessionNC_007802 
Strand
Start bp1471275 
End bp1472318 
Gene Length1044 bp 
Protein Length347 aa 
Translation table11 
GC content64% 
IMG OID637903851 
ProductABC transporter related 
Protein accessionYP_509443 
Protein GI89053992 
COG category[E] Amino acid transport and metabolism 
COG ID[COG4175] ABC-type proline/glycine betaine transport system, ATPase component 
TIGRFAM ID[TIGR03415] choline ABC transporter, ATP-binding protein 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.183759 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCACCG CCGTGTCCTT CCGCGATGTC TGCATCATGT TCGGCCCCCG CCCGGCCCGC 
GCCCTGAGCC TTGCGGATGA AGGCGGCACA AGGTCCGAGA TTCAATCCGC GACGGAGCAT
GTTCTGGGCG TTCACGATTG TTCGCTGGAT GTGGAGGAGG GGGAGATCCT CGTTCTCATG
GGCCTGTCTG GCTCCGGCAA ATCCACGCTT TTGCGCGCGG TTAACGGCCT TAACCCTGTG
GCCCGGGGAC AGGTCGAAGT GCGCGACGGC GATTGGTCCT GCACGCTGCC CGGCGCCTCG
GTCGCGGACC TGCGTTACCT GCGGCAGAAC TGCGTTTCCA TGGTCTTTCA ACAGTTTGGC
CTGCTGCCAT GGCGCACAGT GCGCGAGAAT GTCGGCCTTG GGCTGGAGCT TGCGGGGCAA
TCGGCTACGG CCCGGGCAGA GGCGGTGGAC AAGCAATTGG CGCTCGTGAA CCTCAGCGAA
TGGGGGGATC GCAAGGTGGG CGAATTGTCT GGCGGCATGC AGCAGCGCGT TGGCCTGGCC
CGCGCCTTCG TCACCGACGC CCCGATCCTT CTGATGGATG AGCCGTTCTC CGCCCTTGAT
CCCCTGATCC GTTCCAAACT GCAGGACGAG CTGCTGGACC TCCAGCGCGA CCTGAAACGC
ACCATCATCT TCGTCAGCCA TGACCTGGAT GAGGCGTTCA AGATCGGCAA CCGCATCGCG
ATCCTGGAAG GGGGCCGCAT CGTGCAGATC GGCACGCCCC GGCAGATCTT CTCGGAACCC
GCGACGGGCT ACGTGGCCGA ATTCGTCTCT AACATGAACC CCTTAGGCGT TCTGACCGCA
CGCGACGTCA TGCAAGACGT GCCCACCGAC GCGCCCCGGA TCCCGGTGGA AATGCCCGTC
AAAGACATCC TTGCGCGATT TGCGGACACG CCTGCGCCCC TCGCCGTTGA GGAAGACGGA
GAGGTCATCG GCACCGTAAC GACCGACAGC GTCGCCGCGC GTCTCGGCAC GCCGGAGGCC
GGGCATTCAA CCTCTCCGGC TTAG
 
Protein sequence
MSTAVSFRDV CIMFGPRPAR ALSLADEGGT RSEIQSATEH VLGVHDCSLD VEEGEILVLM 
GLSGSGKSTL LRAVNGLNPV ARGQVEVRDG DWSCTLPGAS VADLRYLRQN CVSMVFQQFG
LLPWRTVREN VGLGLELAGQ SATARAEAVD KQLALVNLSE WGDRKVGELS GGMQQRVGLA
RAFVTDAPIL LMDEPFSALD PLIRSKLQDE LLDLQRDLKR TIIFVSHDLD EAFKIGNRIA
ILEGGRIVQI GTPRQIFSEP ATGYVAEFVS NMNPLGVLTA RDVMQDVPTD APRIPVEMPV
KDILARFADT PAPLAVEEDG EVIGTVTTDS VAARLGTPEA GHSTSPA