Gene CPF_1943 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_1943 
SymbolnusA 
ID4201975 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp2179895 
End bp2180995 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content30% 
IMG OID638082812 
Producttranscription elongation factor NusA 
Protein accessionYP_696376 
Protein GI110800366 
COG category[K] Transcription 
COG ID[COG0195] Transcription elongation factor 
TIGRFAM ID[TIGR01953] transcription termination factor NusA 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATGAAG AATTTCTAGG GGCATTATCT GAAATAGTAA AAGAGAAAGG TATTTCAGTA 
GAGGCTTTAT TAGAAACTAT AGATGATGCT ATAATAGCAG CTTATAAGAA AAACTTTTCA
AATTCAGGAA CAACTGCACA AAATGTAAAG GTAAAACGTG ATGAGAAATC AGGAGAAATC
CATGTTTATG CTCAAAAAGT TGTTGTTGAA GAAGTTTATG ATGATGTAAC AGAAATAAGT
TTAGAAGATG CAAAGGCTAT TAGTGCTATT TATCAATTGG ATGACATAGT AGAAATAGAA
GTAACACCTA AAAACTTTGG AAGAGTTGCA GCACAACTTG CTAAGCAAAT GGTTATCCAA
AGAATAAAAG AATCTGAGAG AAATGTAATT TACTCAGAAT TTGCAGAAAA AGAATTTGAC
ATATTACCTG GTACAGTAAT AAGAAAAGAC AAAGGTAATG TATTTGTAGA CTTAGGAAAA
ATAGAAGGTG TTTTAGGACC AAATGAACAA ATGCCTACAG AAAAATATAA CTTCAATGAA
AAATTACAAT TATACGTAGT TGAAGTTAAG AAAACATCTA AAGGTGCTTC AGTTTTATGT
TCAAGAACAC ATCCAGGTTT AGTTAAAAGA TTATTTGAAT TAGAAGTTCC AGAAATATAT
GAAGGAATAG TTGAAATAAA AAGTATAGCT AGAGAAGCTG GATCAAGAAC TAAAATAGCT
GTTTACTCAA ATGATGAATC AGTAGATGCT ATGGGAGCTT GTGTTGGACC TAAGGGTGTT
AGAGTTCAAA ATATAGTTAA TGAACTAAAA AATGAAAAAA TTGATATAAT AAAATGGAGC
AATACTCCAT CTGAGTATAT AGAAAATGCT TTGAGCCCAG CTAAGGTTAT AAGTGTAGAA
GCTGATGAAG AAACTAAATC AGCTAAGGTT ATAGTGGATG ATAGTCAATT ATCATTAGCT
ATAGGTAAAG AAGGACAAAA CGTTAGATTA GCAGCTAAAT TAACTGGTTG GAAAATAGAC
ATAAAGAGCA AATCTAAAGC AGAAGAATTA TTACAAGAAG AAGATATTGT TGTTGAAGAA
GACACTATAA TAGAAGAATA A
 
Protein sequence
MNEEFLGALS EIVKEKGISV EALLETIDDA IIAAYKKNFS NSGTTAQNVK VKRDEKSGEI 
HVYAQKVVVE EVYDDVTEIS LEDAKAISAI YQLDDIVEIE VTPKNFGRVA AQLAKQMVIQ
RIKESERNVI YSEFAEKEFD ILPGTVIRKD KGNVFVDLGK IEGVLGPNEQ MPTEKYNFNE
KLQLYVVEVK KTSKGASVLC SRTHPGLVKR LFELEVPEIY EGIVEIKSIA REAGSRTKIA
VYSNDESVDA MGACVGPKGV RVQNIVNELK NEKIDIIKWS NTPSEYIENA LSPAKVISVE
ADEETKSAKV IVDDSQLSLA IGKEGQNVRL AAKLTGWKID IKSKSKAEEL LQEEDIVVEE
DTIIEE