Gene Caul_2957 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_2957 
Symbol 
ID5900412 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp3211079 
End bp3212299 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content69% 
IMG OID641563454 
Productmajor facilitator transporter 
Protein accessionYP_001684582 
Protein GI167646919 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG2223] Nitrate/nitrite transporter 
TIGRFAM ID[TIGR00886] nitrite extrusion protein (nitrite facilitator) 


Plasmid Coverage information

Num covering plasmid clones40 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.902774 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCTCCC GCGATTTCCT GAAGGCCGGC CACACGCCGA CCCTGTTCGC CGCGTTCCTG 
TATTTCGATC TGAGCTTCAT GGTCTGGGTG ATCCTGGGTC CGCTCGGCGT GGCCATCGCC
AAGGACTTCC ACCTCGATCC CGCCCAGAAG GGCCTGATGG TCGCTGTGCC GGTCCTGGCC
GGGGCGCTGC TGCGCCTGGT CAATGGCGTG TTGGTCGACC GCATCGGTCC CAAGAAGACG
GGGATGATCA GCCAATTGAT CGTGCTGACC GGCCTGGTCC TGGCCTGGTT CCTGGGCATC
CACAACTATC ACCAGGTGCT GGCCCTGGGC CTGGTGCTGG GCGTGGCCGG CGCCTCGTTC
GCCGTGGCCC TGCCGCTGGC CTCGCGCTGG TATCCGCAGG AGCATCAGGG CCTGGCCCTG
GGCATCGCCG GGGCCGGCAA CTCCGGCACG GCCCTGGCCG CCCTGTTCGC GCCGATCCTG
GCCAAGCACT TCGGCTGGCA GAACGTGATC GGCCTGGCCG CCATCCCGCT AGCCGTCGCC
TTCGTGGTCT ATATGCTGCT GGCCAAGGAC GCCCCCGAGC AGCCCGCCCC CAAGAAGCTC
GCCGAATATA TGGACGTGCT GAAGGTGCCG GACGCCTGGT GGCTGATGCT GCTGTACGCG
GTCACCTTCG GCGGCTTCGT GGGCCTGGCC TCGTCGCTGA CCATCTATTT CAACGCCGAA
TACGGCCTGA CCCCGGTGAC CGCCGGCTTC TTCACCGCCG CTTGCGTGTT CGCCGGCTCG
TTCATCCGTC CGGTGGGCGG GGCCCTGGCC GACCGCTTCG GCGGGGTGCG GACCTTGACC
ATCGTCTTCG CCCTGGCCGC CTTGGGCCTC GCCACGGCCA GCTTCCAGAT GCCTTCGGCC
TGGATCGCCC TGGCGGTGCT GATGTTCTCG ATGCTGGCCC TGGGCGCCGG CAACGGCGCG
GTGTTCCAGC TGGCGCCCCA GCGGTTCCGC AAGGAGATCG GCGTCATGAC CGGCCTGATC
GGCATGACCG GCGGCATCGG CGGCTTCTAC CTGGCCTCGA GCCTGGGCAT GGCCAAGAAG
CTGACCGGTT CCTATCAGAT CGGCTTCCTC GGCTTCGCGG CCCTGGCGGT GTTCGCCCTG
GTCGCCCTGC ACAGCCTCAA GGCCCGCTGG CGCGCCGTCT GGCCGACCCT GCACGGCGAC
GCCGCGCCCG TTCGCGTCTG A
 
Protein sequence
MLSRDFLKAG HTPTLFAAFL YFDLSFMVWV ILGPLGVAIA KDFHLDPAQK GLMVAVPVLA 
GALLRLVNGV LVDRIGPKKT GMISQLIVLT GLVLAWFLGI HNYHQVLALG LVLGVAGASF
AVALPLASRW YPQEHQGLAL GIAGAGNSGT ALAALFAPIL AKHFGWQNVI GLAAIPLAVA
FVVYMLLAKD APEQPAPKKL AEYMDVLKVP DAWWLMLLYA VTFGGFVGLA SSLTIYFNAE
YGLTPVTAGF FTAACVFAGS FIRPVGGALA DRFGGVRTLT IVFALAALGL ATASFQMPSA
WIALAVLMFS MLALGAGNGA VFQLAPQRFR KEIGVMTGLI GMTGGIGGFY LASSLGMAKK
LTGSYQIGFL GFAALAVFAL VALHSLKARW RAVWPTLHGD AAPVRV