Gene Francci3_3564 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3564 
Symbol 
ID3904503 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp4262588 
End bp4263685 
Gene Length1098 bp 
Protein Length365 aa 
Translation table11 
GC content70% 
IMG OID637880885 
ProductNusA antitermination factor 
Protein accessionYP_482645 
Protein GI86742245 
COG category[K] Transcription 
COG ID[COG0195] Transcription elongation factor 
TIGRFAM ID[TIGR01953] transcription termination factor NusA 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0133345 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAGCTCG ACGTGGCCGC GCTGCGCGGA ATCGAGCGGG AGAAGGACAT TGCCTTTGAC 
ACCTTGGTGC AGGCGATGGA GACGGCCCTG CTGACGGCTT ATCACCACAC GGCCGGGTCC
GCGCAGGACG CCCGGGTGGT GATCGACCGG ACGACCGGAG ATGTCTCCGT CCTTGCCCGG
GAACAGGGCC CGGACGGCAC CAGCCGGGAG TACGACGACA CCCCGGCGGA CTTCGGGCGG
ATCGCGACCA TGACCGCCAA ACAGGTGATC ATGCAGCGGC TGCGCGAGGC CCAGCAGGAG
GTCACCTACG GCCAGTACGC CGACCGGGAG CACGAGATCG TTTCCGGTGT GGTGCAACAT
CACGAACAGC GGGCCGGCTC CAGGGTCGTG CTCGTCAATC TCGGCACCGT CGAGGGTGTC
CTGCCGCCGG CCGAGCAGGT CCCCGGCGAG CGGCTTGAGC ACGGCGACCG AATCAAGTGT
TATGTGGTGC ACGTCGCGCG GGGGCCGCAC GGACCCACGG TGACGTTGTC CCGCACCCAT
CCCGAGCTGG TGAAGGGGCT GTTCCGGCTG GAGGTGCCCG AGGTCGCCGA CGGCACCGTG
GAACTCGCCG CGATCGCCCG CGAGGCCGGG CATCGCAGCA AGATAGCGGT GCGGTCCCGG
GTGGCCGGGG TCAACCCCAA GGGCGCGTGC ATCGGGCCGA TGGGCAGCCG GGTGCGTGCC
GTCATGGCCG AACTGCGGGG CGAGAAGATC GATATCGTTG ACTGGTCGGC CGACCCGGCA
ACCTTCGTCG GGAGCGCGCT GTCTCCGGCC CGGGTGGCCC GGGTCGAGGT GACGGATCCG
GCGAGCCGCT CGGCGCGGGT GGTGGTGCCC GATTACCAGC TGTCGTTGGC CATCGGGCGT
GAGGGGCAGA ACGCCCGCCT CGCCGCCCGG CTGACGGGGT GGCGCATCGA CATCCACTCT
GACACCGAGG ACTCCGGCGA GGGCGGTGGG AGCGGATCCG AACGGGACTC CGCTGGAGCG
CCTCGACGAT CGGGACCAGC TACGGTGCCG CGGCGATCAC CTGCCGCGGG TGGCCATTCT
CGGGGCCAGG CGGGATAG
 
Protein sequence
MKLDVAALRG IEREKDIAFD TLVQAMETAL LTAYHHTAGS AQDARVVIDR TTGDVSVLAR 
EQGPDGTSRE YDDTPADFGR IATMTAKQVI MQRLREAQQE VTYGQYADRE HEIVSGVVQH
HEQRAGSRVV LVNLGTVEGV LPPAEQVPGE RLEHGDRIKC YVVHVARGPH GPTVTLSRTH
PELVKGLFRL EVPEVADGTV ELAAIAREAG HRSKIAVRSR VAGVNPKGAC IGPMGSRVRA
VMAELRGEKI DIVDWSADPA TFVGSALSPA RVARVEVTDP ASRSARVVVP DYQLSLAIGR
EGQNARLAAR LTGWRIDIHS DTEDSGEGGG SGSERDSAGA PRRSGPATVP RRSPAAGGHS
RGQAG