Gene Caul_3863 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_3863 
Symbol 
ID5901325 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp4179625 
End bp4181058 
Gene Length1434 bp 
Protein Length477 aa 
Translation table11 
GC content67% 
IMG OID641564385 
Productmajor facilitator transporter 
Protein accessionYP_001685487 
Protein GI167647824 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG2223] Nitrate/nitrite transporter 
TIGRFAM ID[TIGR00886] nitrite extrusion protein (nitrite facilitator) 


Plasmid Coverage information

Num covering plasmid clones37 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTAGGC CGCAATCCAT GTCGACTTCC AAATCTGGTC CGGTGCTCAC CGACTGGCGT 
CCCGAGGACG CCACGTTCTG GAGCGAGACC GGCAAGCGCG TCGCCACCCG CAACCTGTGG
ATCTCGGTGC CCAACCTGCT GCTGGCCTTC TCGGTCTGGA TGGTGTGGTC GATGGTCATC
GCCAAGCTCA AGCTGGTGGG CTTCAAGTTC ACCACCGAGC AGCTGTTCTG GCTGGCCGCC
CTGCCCGCCC TGTCGGGCGC CACCCTGCGG GTGTTCTACA GCTTCATGGT GCCGATCGTC
GGCGGCCGGC GCTGGACCGC CCTGGCCACC CTGTCGCTGC TGATCCCGGC CATCGGCATC
GGAATCGTCG TGCAGCACCC GGAGACGCCC TACGGCGTCT TCGTTCTCCT GGCCCTGGCT
TGCGGCCTGG GCGGCGGCAA CTTCGCCTCC TCCATGGCCA ATATCTCGTT CTTCTTCCCG
AGGAGCGAAA AGGGCGCGGC GCTCGGCGTC AACGCCGGCT TCGGCAATCT CGGCGTCAGC
GTCATGCAGT TCCTGGTGCC GGTGGTGATC GGTCTGGCGC TGTTTGGCCC CCTGGTTGGC
GCGCCCCAGA CCATCGTCGA CCATGGCGTG ACCAAGCAGA TCTGGCTGCA GAACGCCGCC
TATGTCTGGG TCCCGTTCAT TGTCGTCGCC GCCGTGGCGG CCTGGTTCGG CATGAATGAC
CTGACCTCGG CCAAGGCCTC GTTCGCGGCC CAGTCGGTGA TCTTCCGCCG CAAGCACAAC
TGGATCATGT GCTGGCTCTA TCTGGGCACC TTCGGCTCGT TCCTGGGCTT TTCGGCCGCC
TTCCCGCTGC TGACCAAGAC CCTGTTTCCC GACATCAACG TGCTGCAGTT GGCCTTCCTC
GGTCCGCTGA TCGGCGCCGC CTCGCGGGCC CTGACCGGCG GCGTCTCCGA CCGGTTCGGC
GGCGAGCGCG TCACCCACTG GGCCTTCCTG GCCCTGGCCG CCGGCGTGCT CGGCGTGCTG
GCCGGGGTCG GGGCCTATGG CGCCGCGCCG TCCTTCCCGA TCTTCTTCGC CAGCTTCCTG
TGGCTGTTCT TTTGGACCGG GGTCGGCAAC GCCTCGACCT TCCAGATGAT CCCGGCCATC
GTCCGCGCCG ACATGCCTCG CCTGATGCCC CAGGCCACTG TCGAGACCCG TCAGCGGGCC
GCGGAAATGG AGTCGGCGGC CATCGTCGGC TTCACCTCGG CCGTCGGCGC CTTCGGCGGC
TTCTTCATCC CCAAGGCGTT CGGCGACTCG CTGAAGGCCA CGGGTGATCC CCAGTTCGCC
CTCTACCTGT TCCTCGGCTT CTACGTCAGC TGCGTGGTCG TCAACTGGGC CGTCTACGGC
CGCAAGGCCT CGCTGCTGCA CGCGCCCGCC CTCATCAAGG TTCCCGCCCA ATGA
 
Protein sequence
MSRPQSMSTS KSGPVLTDWR PEDATFWSET GKRVATRNLW ISVPNLLLAF SVWMVWSMVI 
AKLKLVGFKF TTEQLFWLAA LPALSGATLR VFYSFMVPIV GGRRWTALAT LSLLIPAIGI
GIVVQHPETP YGVFVLLALA CGLGGGNFAS SMANISFFFP RSEKGAALGV NAGFGNLGVS
VMQFLVPVVI GLALFGPLVG APQTIVDHGV TKQIWLQNAA YVWVPFIVVA AVAAWFGMND
LTSAKASFAA QSVIFRRKHN WIMCWLYLGT FGSFLGFSAA FPLLTKTLFP DINVLQLAFL
GPLIGAASRA LTGGVSDRFG GERVTHWAFL ALAAGVLGVL AGVGAYGAAP SFPIFFASFL
WLFFWTGVGN ASTFQMIPAI VRADMPRLMP QATVETRQRA AEMESAAIVG FTSAVGAFGG
FFIPKAFGDS LKATGDPQFA LYLFLGFYVS CVVVNWAVYG RKASLLHAPA LIKVPAQ