Gene Caul_3862 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_3862 
Symbol 
ID5901324 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp4178267 
End bp4179628 
Gene Length1362 bp 
Protein Length453 aa 
Translation table11 
GC content68% 
IMG OID641564384 
Productmajor facilitator transporter 
Protein accessionYP_001685486 
Protein GI167647823 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG2223] Nitrate/nitrite transporter 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGACGG CGACGCCATC GCAACCGGGA GCCGGATCGG CCCTGACGAT GAGCACGATC 
GCCTTCACCG CCTGTTTCGC GGTGTGGACG GTGTTTTCGA TCATCGGCGT CAAGATCAAG
CAGGACCTGG GACTGAGCGA GGCCCAGTTT GGCCTGCTGG TCGGCACCCC GATCCTGACC
GGCTCGCTGG TGCGGGTGTT CCTGGGCGTG TGGACCGACC AATATGGCGG CCGCCTGGTC
AATCTGCTGG TCATGCTGTC GGCGGCGGCG GCCACCTTCC TGCTCTCCTA CGCCCACACC
TATCCGCAGT TCCTGGTGGC GGCGCTCGGC GTTGGCCTGG CCGGAGGCTC GTTCGCGGTC
GGCGTGGCCT ATGTCTCCAA GTTCTTCCCC AAGGAACGCC AGGGCGCGGC GCTGGGCGTG
TTCGGGGCCG GCAATGTCGG CGCCGCCGTG ACCAAGTTCG CCGCTCCCTT CGTGATGCTG
GCCCTCGGCT GGCAGAGCGT GGCCCAGATC TGGGCCGGGG TGCTGGCCGT GCTGGCTCTG
GCCTTCTTCT TCACCACCCG CGACGAGCCG GACCTGCAGG CGCGCCGCCG CACCGGCGCC
AAGCCGCAGA ACACCGCCGC CCAGCTGGCG CCGTTGCGCA AGCTGCAGGT CTGGCGCTTC
GCCCTCTACT ACTTCTTCGT GTTCGGCGGC TTCGTGGCCC TGTCGCTGTG GCTGCCGCAC
TATCTGGTCG CGGTCTATCA TCTCAACATC ATCGCCGCGG GCATGCTGGC GGCCGCCTAT
TCCATTCCCG GCTCGCTGTT TCGCATCGTC GGCGGCTGGC TGTCGGACAA GATCGGCGCG
CGCAAGGTCA TGTACCTGAC CTTCGGCGTC AGCGCGGTCT GCGCCTTCCT GCTGTCCTAT
CCGGCCACCA GCTACGTGGT CGACGGGGTG CGCGGCCCGA TCGCCTTTCG CCTGGCCACC
GGGCTGGTCC CGTTCGTGAT CCTGCTGTTC ACCCTGGGCT TCGCCATGAG CCTGGGCAAG
GCGGCCGTCT TCAAGCACAT CCCGGTCTAC TACCCCGATC ACATCGGTTC GGTCGGCGGC
CTGGTCGGCA TGGTCGGCGG CCTGGGCGGC TTCGTGATGC CGATCGCCTT TGGCGCCCTC
AACGACCTCA CCGGCGTCTG GACCAGCTGC TTCATGCTGC TGTTCGTCCT GGTCGCCGGA
GCCCTGACCT GGATGCACCT GGCCATCGGC CGCATGGAGC GCGCCAACGC GCCCCAGCTG
GCCAACCTGC CGCAGCTTCC TGAAATGGCC AGCCTTGGCA CGGCCGCCCC GACCCACGCC
GCGCCCGCCG GGCGCGCCGC CATCCAACCC GCCAACTCAT GA
 
Protein sequence
MQTATPSQPG AGSALTMSTI AFTACFAVWT VFSIIGVKIK QDLGLSEAQF GLLVGTPILT 
GSLVRVFLGV WTDQYGGRLV NLLVMLSAAA ATFLLSYAHT YPQFLVAALG VGLAGGSFAV
GVAYVSKFFP KERQGAALGV FGAGNVGAAV TKFAAPFVML ALGWQSVAQI WAGVLAVLAL
AFFFTTRDEP DLQARRRTGA KPQNTAAQLA PLRKLQVWRF ALYYFFVFGG FVALSLWLPH
YLVAVYHLNI IAAGMLAAAY SIPGSLFRIV GGWLSDKIGA RKVMYLTFGV SAVCAFLLSY
PATSYVVDGV RGPIAFRLAT GLVPFVILLF TLGFAMSLGK AAVFKHIPVY YPDHIGSVGG
LVGMVGGLGG FVMPIAFGAL NDLTGVWTSC FMLLFVLVAG ALTWMHLAIG RMERANAPQL
ANLPQLPEMA SLGTAAPTHA APAGRAAIQP ANS