Gene CPR_2259 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_2259 
Symbol 
ID4204129 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp2477588 
End bp2478880 
Gene Length1293 bp 
Protein Length430 aa 
Translation table11 
GC content34% 
IMG OID642566811 
Productnitrite extrusion protein, putative 
Protein accessionYP_699535 
Protein GI110801554 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG2223] Nitrate/nitrite transporter 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAACT TAAACCTTAA GGGTAATCCA AATCGTGGAT TAATAGGAGC TACATTAGGT 
TTCTTCATAG GTTTTGCAGC AGTTTCTCTT TATGGACCAA CTTCTGCAGT ATTTAAAGAA
GCTTTCGTAA ACCTTAATCC AGTATTATTA GCATTACTTA TCGCAGCCCC TAACTTATCA
GGTTCTTTAC TTAGAATACC TTTTTCAGCA TGGGTTGACA CAACTGGTGG AAGAAAGCCA
CTTATAGTTT TACTACTTTT ATCAATAATA GGTATGGGTG GTTTATACGT AGTATTAGCT
TTCTTTAGCG ATAACTTAAA TCAATATTAT TATTTATTAT TTGCACTTGG ACTATTATCT
GGATGTGGTA TAGCAACATT CTCTGTTGGT GTCAGCCAAG CATCATATTG GTTCCCAAAA
AGCAAACAAG GTGTAGCTTT AGGAATATAT GGTGGTGTTG GTAACTTAGC ACCTGGAATA
TTTGCATTAT TAATTCCAAA CTTAGCACTT CCTCTATTAG GATTACCAGG TTCTTACTTA
GCTTGGTTAA TATTCTTAAT AGTTGGTACA GTAATTTATA TAAAAATAGG TCAAAACGCT
TGGTACTTCC AACTTGTTGA CAAAGGAATA AATAAAGATG AAGCTAAGGA AATAGCATCA
AAAGATTACG GTCAAGAGTT ATTCCCTAAG GGAAAAGCAA GTGAAACACT TTTAATATCA
GCTAAGTCAT GGAAAACTTG GGCTTTAGTA TTTATATATT TCACAACTTT CGGTGGATTC
TTAGCATTAA CAGGTTGGTT CCCTAACTAT TGGATGACTT ATTTCGGATT AAATATGAAA
GTTGCAGGTC TTTTAACAGC TTTATATTCT ATATTAACTT CATTAACTAG AATATATGGT
GGAAAAGTTG CTGATAAAAA TGGCGGAGAA ATAACTACTA TAGTTTCTCT AGGAATAGCA
CTTATCGGTG CAATTTGTAT GACTTTTGCT TCAACTATGA CTTTAGCAAT AATAGGTATA
ATCTTACTTG CTATTGGTAT GGGAGTTGCA AACGCAGCAG TATTTAAAAT AGTTCCTAAT
GCAGTTCCAG AGGCAATGGG TGGAGCTTCA GGATGGATCG GTGGATTAGG TGCTTTAGGT
GGATTCTTAA TACCTCCTGT AATGGCATCA TTCCTAAACA GATCAGGATT TGCTGGATAT
TCACAAGGAT TTTCAGTATT TATAGTTTTA ATAATTATAG CAATTGGAGT AATAGCTTTA
TTCCAAGCTT TACAAAAGAA ATCAGCAAAA TAA
 
Protein sequence
MKNLNLKGNP NRGLIGATLG FFIGFAAVSL YGPTSAVFKE AFVNLNPVLL ALLIAAPNLS 
GSLLRIPFSA WVDTTGGRKP LIVLLLLSII GMGGLYVVLA FFSDNLNQYY YLLFALGLLS
GCGIATFSVG VSQASYWFPK SKQGVALGIY GGVGNLAPGI FALLIPNLAL PLLGLPGSYL
AWLIFLIVGT VIYIKIGQNA WYFQLVDKGI NKDEAKEIAS KDYGQELFPK GKASETLLIS
AKSWKTWALV FIYFTTFGGF LALTGWFPNY WMTYFGLNMK VAGLLTALYS ILTSLTRIYG
GKVADKNGGE ITTIVSLGIA LIGAICMTFA STMTLAIIGI ILLAIGMGVA NAAVFKIVPN
AVPEAMGGAS GWIGGLGALG GFLIPPVMAS FLNRSGFAGY SQGFSVFIVL IIIAIGVIAL
FQALQKKSAK