Gene Jann_3042 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagJann_3042 
Symbol 
ID3935513 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameJannaschia sp. CCS1 
KingdomBacteria 
Replicon accessionNC_007802 
Strand
Start bp3068903 
End bp3070576 
Gene Length1674 bp 
Protein Length557 aa 
Translation table11 
GC content61% 
IMG OID637905413 
Productnitrate transport ATP-binding subunits C and D 
Protein accessionYP_510984 
Protein GI89055533 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1116] ABC-type nitrate/sulfonate/bicarbonate transport system, ATPase component 
TIGRFAM ID[TIGR01184] nitrate transport ATP-binding subunits C and D 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.499138 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.908503 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCATGTC TTGAGCTGAA AAATGTGTCC AAAAGCTATG GCGAAACGCC CGTGCTGAGC 
GACATTAACC TTGAGATCAA GGAGGGGGAA TTTCTGGTCC TTCTGGGATT TTCGGGCACC
GGAAAGACGA CGCTGATCAA CCTGATGGCG GGCCTGGAGG CACCGTCGAA GGGAGAGGTG
ACGTTCAAGG GCGCCCCGGT GGTTGAACCG GGCCCGGAGC GCGGCGTGAT TTTCCAGAAC
TATTCGTTGA TGCCGTGGTT AACGGTGGCG GGGAATGTGG GGCTGGCCGT GGACACGATG
TTTGGCGATC TGCCCAGGGC AGAGCGCGCC AAGAGGGTGG ACCGCTACGT GGATATGGTG
GGGCTGACGC CTGCTGCCAC GCGCCGCCCG GCGGAATTGT CCGGCGGGAT GCGGCAGAGG
GTGAACGTGG CACGGGCCTT AGCGATGGAC CCCGAAATGC TGCTGCTGGA TGAGCCGTTG
AGCGCGCTGG ATGCGCTGAC AAGGGCGAAT TTGGCGGAAG AGATTGAGAG GATCTGGGAG
GCCAGCAAAA AGACCTGCGT GCTGATCACA AACGATGTGG ATGAGGCGAT CTTGCTGGCG
GATCGGATCA TTCCGATGAA CCCCGACGGC ACTTTGACGG ACGCGTTCGA GGTGGGCATT
GCACGCCCCA GGGACCGGGT CGCGATGAAC ACAGACGCCG AGTTCATTCG CCTGCGCGCT
GAGGTCACGA AGTACCTGAT GGATGTGGGG ATCGAGGCGA AGGTGGAAGG CACGCGCGTG
CTGCCCGAGG TGACGCCGAT CCACGGCGTG CCTTTGGCGG TAGCGAATGC CGCGCAAACG
GCGCTGGAAG AGAGGTATCT GGAATTCTCC AAGGTCCATA AGGTTTACCC GACGCCGAAG
GGCCCGCTGA CAGTGGTCGA AGATTTCGAC CTGAAGCTGC GCCGGGGGGA ATTTATCTCG
CTGATCGGGC ATTCGGGCTG CGGTAAATCC ACGGCGCTGA CGATGGTGGC GGGGCTCAAC
CCGATCTCCA AGGGCGCGAT CAAGCTGGAC GGACGCGCTG TTGAGGGCGC GGATCCGGAG
CGGGCGGTGG TGTTTCAGTC CCCGTCGTTA TTCCCATGGC TATCTGCCCG CGAAAACTGC
GCGATTGGGG TGGATAAGGT CTACCCCAAA GCGTCGCGGG CGGAGCGGCA GGATGTGGTG
GATTACTACC TTGAACGGGT GGGTCTTGCC GACGCGATGG ACAAGCGTGC GGCCGACCTG
TCCAACGGCA TGAAACAGCG CGTGGGCATT GCGCGGGCCT TTGCCCTTTC CCCCAAATTG
CTGCTGCTCG ATGAGCCGTT TGGCATGCTC GACAGCCTCA CCCGGTGGGA GCTGCAAGAG
GTCCTGATGG AGGTCTGGTC GCGCACCAAA GTCACCGCGA TTTGCGTCAC CCATGATGTG
GATGAGGCCA TTCTTTTGGC CGACCGTGTT GTCATGATGA CCAACGGGCC GCAGGCGACC
ATCGGCAAGA TCACGGATGT GAACCTGCCC CGCCCGCGCA CCCGCAAGGC GCTGTTAGAG
CACCCGGATT ACTACAGCTA CCGCCAGGAT GTCCTCGATT TCCTTGAGGA ATACGAGCAT
GGCGCGAAAC CCAGACCAAA AGCCGCAGCG CCCAAAGCTG TCGCGGCGGA GTGA
 
Protein sequence
MACLELKNVS KSYGETPVLS DINLEIKEGE FLVLLGFSGT GKTTLINLMA GLEAPSKGEV 
TFKGAPVVEP GPERGVIFQN YSLMPWLTVA GNVGLAVDTM FGDLPRAERA KRVDRYVDMV
GLTPAATRRP AELSGGMRQR VNVARALAMD PEMLLLDEPL SALDALTRAN LAEEIERIWE
ASKKTCVLIT NDVDEAILLA DRIIPMNPDG TLTDAFEVGI ARPRDRVAMN TDAEFIRLRA
EVTKYLMDVG IEAKVEGTRV LPEVTPIHGV PLAVANAAQT ALEERYLEFS KVHKVYPTPK
GPLTVVEDFD LKLRRGEFIS LIGHSGCGKS TALTMVAGLN PISKGAIKLD GRAVEGADPE
RAVVFQSPSL FPWLSARENC AIGVDKVYPK ASRAERQDVV DYYLERVGLA DAMDKRAADL
SNGMKQRVGI ARAFALSPKL LLLDEPFGML DSLTRWELQE VLMEVWSRTK VTAICVTHDV
DEAILLADRV VMMTNGPQAT IGKITDVNLP RPRTRKALLE HPDYYSYRQD VLDFLEEYEH
GAKPRPKAAA PKAVAAE