Gene Csal_1133 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCsal_1133 
Symbol 
ID4029159 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChromohalobacter salexigens DSM 3043 
KingdomBacteria 
Replicon accessionNC_007963 
Strand
Start bp1291995 
End bp1293215 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content66% 
IMG OID637966310 
Productnitrate transporter, putative 
Protein accessionYP_573188 
Protein GI92113260 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0226657 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACGATC GCCAGACGCC TCGAGATGCG TGGCCTTCGC CGGAACTCGA CACGCTGACG 
CTGGGCATGC TGCCGCTCAA CGATGCCGCG CCGTTGGTGG TAGCGCGTGA GCGCGGCTTC
TTCGCCGAGC AGGGGCTGGA CGTCACGTTG AGCGTGGAGT CGTCGTGGGC GGGGATACGC
GATGCCATGC AGTTGGCACT GCTCGATGGC GCACAGATGC TGCCATTGAT GCCACTGGCC
TCGACGCTTG GACTGGACGG TCGACGAACG CCGATGCTCT CGGCGCTGAC CCTCAGTCTC
GGCGGCAATG CCATCACCGT GTCCAACGCC TTGTACGACG ACATGCTGGC TGCCGATCCC
GACGCCATGG CGGCCTCACC GACCTCCGCC CGAGCGCTGG CGACGGTGGT GAGACAGCGG
CGGGAGAAGG GGGCGCCACC GCTGCGTTTC GCCAGCGTGT ATCCCTTTTC CTCGCATCGC
TATTTATTGC GCTACTGGCT GGCGGAAGGC GGTGTCGATC CCGATCGCGA TCTCGAATTG
CGCGTGGTAC CGCCGCCTCT GGTGGCACAG CAGCTCGAGG CCGGATGGCT GGACGGCTAT
TGCGTGGGCG AGCCCTGGAA TACGCTCGCG GCACGCCGCG AACGTGGGCG TGTGCTGATC
GGCAGCCACG CCATCTGGCA GAACGGCCAG GAGAAGGTCT TCGCGGTACG CGAAGAGTGG
GCCCAGGCGT ATCCGGCCAC GCACCGTGCC GTCCTGCGTG CCTTGTTGAA GGCATGCGCA
TGGCTCGAGG CTCCGGCGCA CCGAGGCGAG GCGGCACGCC TGATGTGCGA GCGGGGGTAT
CTGGATGTCG CGCCCAGCGT GGTCGAGGAC AGCATGCTGG CGGAGGGGGC GTCGCTGACG
CATGGCATGA GTGTCTTTCA TCGTCATGCC GCGAATTTCC CGTGGCGCTC CCATTTGCTC
TGGTACGCCG GGCAGATGCG GCGCTGGCAG CACCTGGACG TCGACGACGC GGTGCTTGAT
GAGTGCCTGT CCCGTTGTGT CCGGCCGGAC CTGTTTCGTG AGGCCGCCGA TGACCTGGGC
ATCAATTATC CGTTGGTGGA TGCCAAGAGC GAGGGCGAGC ATGGGGGCGC CTGGGTGATG
GAAGGGCGGC ATGGTCCGCT CGACATGGGC GGCGACCGGC TTCTGGGCGG TGCCCGATTC
GCACCTTCCC TGCATTCATG A
 
Protein sequence
MNDRQTPRDA WPSPELDTLT LGMLPLNDAA PLVVARERGF FAEQGLDVTL SVESSWAGIR 
DAMQLALLDG AQMLPLMPLA STLGLDGRRT PMLSALTLSL GGNAITVSNA LYDDMLAADP
DAMAASPTSA RALATVVRQR REKGAPPLRF ASVYPFSSHR YLLRYWLAEG GVDPDRDLEL
RVVPPPLVAQ QLEAGWLDGY CVGEPWNTLA ARRERGRVLI GSHAIWQNGQ EKVFAVREEW
AQAYPATHRA VLRALLKACA WLEAPAHRGE AARLMCERGY LDVAPSVVED SMLAEGASLT
HGMSVFHRHA ANFPWRSHLL WYAGQMRRWQ HLDVDDAVLD ECLSRCVRPD LFREAADDLG
INYPLVDAKS EGEHGGAWVM EGRHGPLDMG GDRLLGGARF APSLHS