Gene Jann_3040 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagJann_3040 
Symbol 
ID3935511 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameJannaschia sp. CCS1 
KingdomBacteria 
Replicon accessionNC_007802 
Strand
Start bp3066280 
End bp3067638 
Gene Length1359 bp 
Protein Length452 aa 
Translation table11 
GC content62% 
IMG OID637905411 
Productputative nitrate transport protein 
Protein accessionYP_510982 
Protein GI89055531 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0757013 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.570459 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCGGA TTATCGCAGC CCTCTTTACC ACCACAGCGC TCGCAGGCCC GCTTGCAGCA 
CAAGACCTCG AAATCGACGA GCTGACCTTC GGCTTCATCA AACTCACCGA TATGGCGCCG
CTCGCGATTG CCTATGAGAT GGGCTTTTTC GAGGACGAAG GCCTCTTCGT TACGCTTGAG
GCGCAAGCCA ATTGGCGGGT CCTTTTGGAC GGGGTGATCG ACGGTACGCT GGACGGCGCG
CACATGCTCG CGGGTCAACC GATTGCGGCT ACAATCGGCT ACGGCACGCA GGCCAATATC
ATCACCCCAT TCTCCATGGA CCTCAACGGC AACGGCATCA CGGTCTCCAA CGAGGTCTGG
GATTTGATGC GCCCGCACAT CCCCTCTATG GACGATGGCC GCCCGGTTCA TCCAATCAGC
GCCTCGGCCC TCGCGCCCGT CGTCGAGCAA TACCGCCAAG AGGGCACGCG CTTTGACATG
GGCATGGTCT TCCCCGTCTC CACCCATAAT TACGAGATCC GCTTTTGGCT GGCCGCCGGT
GGCCTGCACC CAGGTTTTTA TAGCCCTGAC GACATCACCG GCACGATTGA CGCGGATGTT
TTCCTGTCTG TCACGCCCCC TCCGCAGATG CCTGCGACGC TGGAAGCGGG GACTATTTTT
GGCTACGCGG TGGGGGAGCC CTGGAACCAG CAGGCCGTCC AACGGGGCAT CGGTGTGCCG
GTGATCACCG ATTATCAATT GTGGCCCAAC AACCCCGAAA AGGTCTTTGG GATTACCGAA
GACTTCGCCG AACAGAACCC CAACACCACG CAGGCCATCG TCCGCGCGCT GATCCGGGCC
GGCATGTGGC TGGACGAGAA TGACAACGCC AACCGTGCGG AAGCCGTGTC GATCCTGTCC
TACCCGGAAT ACGTGGGCGC AGACGAAGAC GTCATCGCGG CGTCCATGAC TGGTACGTTC
GAATTCGAGC CCGGCGACGT GCGCGACATC CCCGATTTCA ACGTCTTCTT CCGCTACTAC
GCGACCTATC CCTACTATTC GGATGCGGTC TGGTACCTGA CGCAAATGCG CCGCTGGGGC
CAAATCCCCG AGGCCATGTC CGATGAGTGG TACCACGAGG TTGCGGCGCA AGTGTACCGC
CCCGACATCT ATCTGGAGGC CGCGCGCAGC CTGGTCGATG ACGGCTTGGC GGCGGAGGCC
GACTTCCCCT GGGACACCGA CGGCTTCCGT GACGTTGAGA CCGAGATGAT GGGCGGCGTG
CCCTACGACG GGCGCACGCC CAACGCCTAT ATCGACGCGC TGGAGATCGG TCTGACCGGC
GACGAAGTGG TCGTTGACGG GGCCGTCACG GGCGGCTGA
 
Protein sequence
MNRIIAALFT TTALAGPLAA QDLEIDELTF GFIKLTDMAP LAIAYEMGFF EDEGLFVTLE 
AQANWRVLLD GVIDGTLDGA HMLAGQPIAA TIGYGTQANI ITPFSMDLNG NGITVSNEVW
DLMRPHIPSM DDGRPVHPIS ASALAPVVEQ YRQEGTRFDM GMVFPVSTHN YEIRFWLAAG
GLHPGFYSPD DITGTIDADV FLSVTPPPQM PATLEAGTIF GYAVGEPWNQ QAVQRGIGVP
VITDYQLWPN NPEKVFGITE DFAEQNPNTT QAIVRALIRA GMWLDENDNA NRAEAVSILS
YPEYVGADED VIAASMTGTF EFEPGDVRDI PDFNVFFRYY ATYPYYSDAV WYLTQMRRWG
QIPEAMSDEW YHEVAAQVYR PDIYLEAARS LVDDGLAAEA DFPWDTDGFR DVETEMMGGV
PYDGRTPNAY IDALEIGLTG DEVVVDGAVT GG