Gene Franean1_1180 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1180 
Symbol 
ID5669593 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp1403454 
End bp1404539 
Gene Length1086 bp 
Protein Length361 aa 
Translation table11 
GC content72% 
IMG OID641240112 
ProductNusA antitermination factor 
Protein accessionYP_001505540 
Protein GI158313032 
COG category[K] Transcription 
COG ID[COG0195] Transcription elongation factor 
TIGRFAM ID[TIGR01953] transcription termination factor NusA 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00193855 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.133993 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAGCTCG ACGTCGCTGC GCTGCGGGGA ATCGAGCGCG AGAAGGAGAT CGCCTTCGAC 
ACGCTCGTCG AGGCGATCGA GACCGCGTTG TTGACCGCCT ACAAGCACAC CACCGGATCC
GCCGACGACG CTCGGGTGGT GATCGACCGG ACCAGCGGCG AGGTCGCGGT CTTTGCGCGG
GAGAGCGGCC CGGACGGGAC GTCCCGCGAG TGGGACGACA CCCCGGCGGA CTTCGGTCGG
ATCGCCACGA TGACCGCCAA GCAGGTCATC ATGCAGCGCC TGCGCGAGGC CCAGCAGGAG
GTCACCTACG GCCAGTACGC CGACCGCGAG CACGAGGTGG TCTCGGGCGT GGTCCAACAT
CACGAGCAGC GGGCCGGTTC CCGCGTCGTG CTGGTCGATC TCGGCACGGT GGAGGGCGTG
CTGCCCCCGG CCGAGCAGGT TCCCGGCGAG CGCCTGGAGC ACGGTGACCG GATCAAGTGC
TATGTGGTGC ACGTGGCCCG CGGGATGCAC GGCCCGACGG TCACCCTCTC GCGGACCCAT
CCCGAGCTGG TGAAGGGCCT GTTCCGGCTG GAGGTGCCCG AGGTCGCCGA CGGCACGGTC
GAACTCGCCG CGATCGCCCG CGAGGCCGGT CACCGCACGA AGATCGCGGT GCGTTCGAAG
GCGGCCGGGG TGAACCCGAA GGGCGCCTGC ATCGGCCCGA TGGGCAGCCG GGTGCGCGCC
GTGATGGCGG AGCTGCACGG CGAGAAGATC GACATCGTCG ACTGGTCGGC GGATCCCGCG
TCCTTCGTGG GCAGCGCGCT CTCGCCGGCC AGGGTGTCCC GGGTGGAGGT CACCGACCTG
GCGAGTCGTT CGGCACGGGT GGTCGTTCCC GACTACCAGC TCTCGCTCGC GATCGGCCGG
GAGGGGCAGA ACGCCCGGCT GGCCGCCCGG CTCACCGGAT GGCGGATCGA CATCCACTCC
GACACCGAGG GTAGCGAGCC GCGCGCGGAG CGGCCGGCCG GGGAGGCCCC CCGTCGTCCG
GGGACGGGGA CCGGGCCCCG GAGGTCCTCC GCAACGGGCG GCCACTCTCG GGGCGCAACG
GGATAG
 
Protein sequence
MKLDVAALRG IEREKEIAFD TLVEAIETAL LTAYKHTTGS ADDARVVIDR TSGEVAVFAR 
ESGPDGTSRE WDDTPADFGR IATMTAKQVI MQRLREAQQE VTYGQYADRE HEVVSGVVQH
HEQRAGSRVV LVDLGTVEGV LPPAEQVPGE RLEHGDRIKC YVVHVARGMH GPTVTLSRTH
PELVKGLFRL EVPEVADGTV ELAAIAREAG HRTKIAVRSK AAGVNPKGAC IGPMGSRVRA
VMAELHGEKI DIVDWSADPA SFVGSALSPA RVSRVEVTDL ASRSARVVVP DYQLSLAIGR
EGQNARLAAR LTGWRIDIHS DTEGSEPRAE RPAGEAPRRP GTGTGPRRSS ATGGHSRGAT
G