Gene CPR_1661 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_1661 
SymbolnusA 
ID4204988 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp1853943 
End bp1855043 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content30% 
IMG OID642566211 
Producttranscription elongation factor NusA 
Protein accessionYP_698976 
Protein GI110803140 
COG category[K] Transcription 
COG ID[COG0195] Transcription elongation factor 
TIGRFAM ID[TIGR01953] transcription termination factor NusA 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATGAAG AATTTCTAGG GGCATTATCT GAAATAGTAA AAGAGAAAGG TATTTCAGTA 
GAGGCTTTAT TAGAAACTAT AGATGATGCT ATAATAGCAG CTTATAAGAA AAACTTTTCA
AATTCAGGAA CAACTGCACA AAATGTAAAG GTAAAACGTG ATGAGAAATC AGGAGAAATC
CATGTTTATG CTCAAAAAGT TGTTGTTGAA GAAGTTTATG ATGATGTAAC AGAAATAAGT
TTAGAAGATG CAAAGGCTAT TAGTGCTATT TATCAATTAG ATGATATAGT AGAAATAGAA
GTAACACCTA AAAACTTTGG AAGAGTTGCA GCACAACTTG CTAAGCAAAT GGTTATCCAA
AGAATAAAAG AATCTGAGAG AAATGTAATT TACTCAGAAT TTGCAGAAAA AGAATTTGAC
ATATTACCTG GTACAGTAAT AAGAAAAGAT AAAGGTAATG TATTTGTAGA CTTAGGAAAA
ATAGAAGGTG TTTTAGGACC AAATGAACAA ATGCCTACAG AAAAATATAA CTTCAATGAA
AAATTACAAT TATACGTAGT TGAAGTTAAG AAAACATCTA AAGGTGCTTC AGTTTTATGT
TCAAGAACAC ATCCAGGTTT AGTTAAAAGA TTATTTGAAT TAGAAGTTCC AGAAATATAT
GAAGGAATAG TTGAAATAAA AAGTATAGCT AGAGAAGCAG GATCAAGAAC TAAAATAGCT
GTTTACTCAA ATGATGAATC AGTAGATGCT ATGGGAGCTT GTGTTGGACC TAAGGGTGTT
AGAGTTCAAA ATATAGTTAA TGAACTAAAA AATGAAAAAA TTGATATAAT AAAATGGAGT
AATACTCCAT CTGAGTATAT AGAAAATGCT TTAAGCCCAG CTAAGGTTGT AAGTGTAGAA
GCTGATGAAG AAACTAAATC AGCTAAGGTT ATAGTTGATG ATAGTCAATT ATCATTAGCT
ATAGGTAAAG AAGGACAAAA TGTTAGATTA GCAGCTAAAT TAACTGGTTG GAAAATAGAC
ATAAAGAGCA AATCTAAAGC AGAAGAATTA TTACAAGAAG AAGATATTGT TGTTAAAGAA
GACACTATAA TAGAAGAATA A
 
Protein sequence
MNEEFLGALS EIVKEKGISV EALLETIDDA IIAAYKKNFS NSGTTAQNVK VKRDEKSGEI 
HVYAQKVVVE EVYDDVTEIS LEDAKAISAI YQLDDIVEIE VTPKNFGRVA AQLAKQMVIQ
RIKESERNVI YSEFAEKEFD ILPGTVIRKD KGNVFVDLGK IEGVLGPNEQ MPTEKYNFNE
KLQLYVVEVK KTSKGASVLC SRTHPGLVKR LFELEVPEIY EGIVEIKSIA REAGSRTKIA
VYSNDESVDA MGACVGPKGV RVQNIVNELK NEKIDIIKWS NTPSEYIENA LSPAKVVSVE
ADEETKSAKV IVDDSQLSLA IGKEGQNVRL AAKLTGWKID IKSKSKAEEL LQEEDIVVKE
DTIIEE