Gene Jann_0040 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagJann_0040 
SymbolnusA 
ID3932476 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameJannaschia sp. CCS1 
KingdomBacteria 
Replicon accessionNC_007802 
Strand
Start bp38314 
End bp40086 
Gene Length1773 bp 
Protein Length590 aa 
Translation table11 
GC content60% 
IMG OID637902381 
Producttranscription elongation factor NusA 
Protein accessionYP_507982 
Protein GI89052531 
COG category[K] Transcription 
COG ID[COG0195] Transcription elongation factor 
TIGRFAM ID[TIGR01953] transcription termination factor NusA
[TIGR01954] transcription termination factor NusA, C-terminal duplication 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.72674 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCATCA CGTCAGCCAA CCAGCTGGAG CTTTTGCAAA CCGCAGAAGC CGTTGCCCGC 
GAAAAGATGA TCGACCCCAA TCTGGTGGTC GAGGCGATGG AAGAAAGTCT CGCCCGGGCT
GCAAAGTCCC GCTATGGGTC CGAGCTGGAT ATCCGCGTTT CCATCGACCG CAAGACGGGC
CGCGCCACGT TCACCCGTGT GCGCACCGTG GCGGATGAAG ACACGCTGGA AAACGACAAG
GCCGAAATGC TGCTGGCCGA TGCCGACGCA ACGGTTGCCG CCGAGCGCCC CAATGTCGTG
GTCTTCAACG CCAAGTCCCA CGTCTATGAT GAGGAAGGCG AGGTTGTGGA GACCCGCGAC
GTGCGCCGTT TTGTCATGCG CAAGCCGACC GAGAATGACA AGCTCCTGAC CGAGGGCGTG
GATCAATCCA CCGGCCCCGT GATCGGTGAC ATGATCGCCG ACGAAGTGCC GCCCGTGGAA
ATGGGCCGGA TCGCCGCGCA ATCGGCCAAG CAGGTCATTT TGCAGAAGGT CCGCGAAGCC
GAGCGTGACC GTCAGTTCGC CGAGTTCAAG GACCGTGTGG GCGAGATCAT CAACGGCGTC
GTCAAGCGTG AGGAATACGG CAACGTCATC GTCGACGTAG GCTCCGGTGA GGCGCAGCTG
CGCCGCAACG AGAAGATCGG GCGGGAAGCC TATCGCAACG GCGACCGCAT TCGCTGCTAC
ATCAAGGATG TGCGCCGTGA GAACCGCGGC CATCAGATCT TCCTGTCGCG CACCGCGCCG
GAGTTCATGC GCGAGTTGTT CAAGATGGAA GTGCCCGAGA TCTATGACGG CATCATCGAG
ATCAAGGCCG TCGCCCGCGA TCCCGGGTCG CGTGCCAAGA TTGCGGTGAT TTCCTATGAC
AATGGCATCG ACCCCGTGGG CGCCTGTGTG GGTATGCGCG GCAGCCGGGT GCAGGCTGTC
GTCAACGAAT TGCAGGGCGA AAAGATCGAC ATCATTCCGT GGAATGAGGA TGCGCCGACG
TTCCTGGTGA ATGCGCTGCA ACCTGCCGAG GTCTCCAAGG TGGTTCTGGA TGAGGACGCC
GGGAAGATCG AAGTCGTGGT GCCCGATGAG CAGCTGAGCC TTGCCATTGG TCGGCGCGGT
CAGAACGTGC GTCTTGCCTC ACAGCTGACC AACCTTGATA TCGATATCCT GACCGAGGAA
GAGGAATCCA AGCGTCGTCA GGCCGAGTTT GAAGAGCGCA CGAAGCTGTT CATGGATACG
CTCGACCTGG ATGAGTTCTT CGCGCAGCTT CTGGTCTCCG AAGGCTTCAC GGCGTTGGAA
GAAGTGGCCT ATGTGGAAGC GGACGAGCTT CTGGTCATCG ACGGCGTTGA TGAGGGTACT
GCGGAAGAAT TGCAGGCCCG CGCCCGCGAT TATTTGGAGG CGCAGAATAA ACTGGCGCTT
GAGAAGGCGA AAGAGATGGG CGTTGAAGAG AGCCTCATTG CATTTGAGGG CCTTACTCCC
CAAATGTTGG TGGCCTTGGG CGAAGACGGC GTGAAAACGC TGGAAGACTT CGCTACCTGC
GCAGATTGGG AACTGGCCGG CGGTTGGACA ACCGAAGGCG GAGAGCGCAT CAAGGACGAC
GGCCTTTTGG AGCCCTTCGA AGTCTCTTTG GAAGAAGCAC AGAACATGGT GATGACCGCA
CGCCTGCAAC TGGGTTGGGT GACGATCGAG GAATTGGAAG CCGACGCCGC TGCCGAAGCA
GAAGCCGCCG CACAGGAGGG GGCAGAGGGC TAA
 
Protein sequence
MAITSANQLE LLQTAEAVAR EKMIDPNLVV EAMEESLARA AKSRYGSELD IRVSIDRKTG 
RATFTRVRTV ADEDTLENDK AEMLLADADA TVAAERPNVV VFNAKSHVYD EEGEVVETRD
VRRFVMRKPT ENDKLLTEGV DQSTGPVIGD MIADEVPPVE MGRIAAQSAK QVILQKVREA
ERDRQFAEFK DRVGEIINGV VKREEYGNVI VDVGSGEAQL RRNEKIGREA YRNGDRIRCY
IKDVRRENRG HQIFLSRTAP EFMRELFKME VPEIYDGIIE IKAVARDPGS RAKIAVISYD
NGIDPVGACV GMRGSRVQAV VNELQGEKID IIPWNEDAPT FLVNALQPAE VSKVVLDEDA
GKIEVVVPDE QLSLAIGRRG QNVRLASQLT NLDIDILTEE EESKRRQAEF EERTKLFMDT
LDLDEFFAQL LVSEGFTALE EVAYVEADEL LVIDGVDEGT AEELQARARD YLEAQNKLAL
EKAKEMGVEE SLIAFEGLTP QMLVALGEDG VKTLEDFATC ADWELAGGWT TEGGERIKDD
GLLEPFEVSL EEAQNMVMTA RLQLGWVTIE ELEADAAAEA EAAAQEGAEG