Gene Francci3_4199 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_4199 
Symbol 
ID3907164 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp5013072 
End bp5014646 
Gene Length1575 bp 
Protein Length524 aa 
Translation table11 
GC content71% 
IMG OID637881527 
Producthypothetical protein 
Protein accessionYP_483276 
Protein GI86742876 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCTCA TCCGTTCCCG TGAGGTGTGG GAGACGCACA TGGACCCTCA TCCCGACCAG 
CTCCTGCGCG ACCGTGCGGG CGGACCGATA CCGGACGACG TCGAGACCAT GGCCCGCGAT
GTCATGTGGT ACGCCTACGC GCTGGGTGAC GACCGGCCGG CGGTGCCAGG CGTTTACGGC
GAGCAGGCAG GCTTCGCGGC CCCCGACGCC CCCGAGGAGG TCGCACCGCG GCTGCCGGTC
ACCGAGCCCG GCGCACCGGC CACCCCGCTT CCCCTGGGCA CGGTGGAACT GCTGGAGCGC
ATGCCCTACC CGTGCGGCAG GGCACCCGAG GCCGGCACAT CCGCCGAAGC CCTCGGGCAC
GCCTTGGTCA CCGCCTTCGG CCCGCACCGG CGCGAGCCGG AGAATCCGAT CAACGACCAC
CGCGGCTATG CGTCCGTGCG CAGCAAGTTC CCCGTGCACG CCTTCGTAGC CGACCGCGAA
CGACGGCAGG TCCTGGATGT CCACCGCAAC GCGCTGGTGG ACCTCGCCGG CGCCAACCCC
CCGCAGGACC CCTCGGCCGA CGGCTGCGAG ATCCTTCTGA TCGGGCGGTA CACGCGGCTG
CCCGCGAGCT ACCGGTGGTT CCGCGGCGCG CTGGTCCACC TGGAGATGGG CATCGCACTG
CGGTCCCTGT GCCTGGCGTT GGAGTTGTTC GGCCTGTCCG GCCGGCTCCG GCTGCCGAAC
AGCGGCCAGG GGCGGCTGCG CGAGTTCGGC CTGCGCCCTG CCTGGGAGTG GACGCTGCCG
CTGACCGTCG AACTGGCCGA ACGACCGGCA CCGGTGCAGA CCACCGGGCG GGGCGCCGAT
ACCGTTCCCG ACCCCGTGCT CGCCGACGTC CTCGCCATGA ACCGGGCCCA GACCTTCACC
GAAGCGCCGG CCACCCTGGG ACCCGCGGCG CCCAGCCAGG ATGACGTGAC CGCGGCAGCG
GAACGGTCCT GGGCCGAACT GCTGTGGTCA CGCACGTCCG GACGTATGCC CCGAGGCCTG
TATGGCATGA ACGGTCGGCG GCGGGTACTC CCGGCTGCCG TGCTCCAGGA CGCGGTGCGC
TGGCTGGGCG TGCCCCCGCC CGGAGACGTC CTGCGGGGCG TGTCCAACGC GATCAGAACG
ACCGTCGTGA TCCAGGGCAT CGAGGGCCAC CGCGACGGGG TGTACGAGGT CGCGGACGGC
CACACCGTGC TGAAAGCGGC GGACACGACG GCTGCGGCCC GTCTCGAAGA GGTCTACGGC
TACACTCTGG CACCGGGCAA CGGTTGCGAC GTTCGGCACG CCTCGATGAT CTGGTTTCTC
ACGGTCGACC CCCGAGTCGT GGTCGACCGC TACGGTCCGG GCGGCTGGAC CACGATCCAG
TACGTGTGCG GCTGGGCGGC CCACGGACTG ACGCTCGCGG CAGCCGGGTC GGGCCTGTAC
GCCAGGCCCG TACGGGCCTT TTATGAGGCA GCGGCACGGC GCGTCCTGGG TCTCGGTTCC
GAGGAGATGG TCGTGCTGGC GGTGATCGGC GGAACTCCCG GATACCGGGG CCTGATGCTC
GACCTGCGAA GCTGA
 
Protein sequence
MTLIRSREVW ETHMDPHPDQ LLRDRAGGPI PDDVETMARD VMWYAYALGD DRPAVPGVYG 
EQAGFAAPDA PEEVAPRLPV TEPGAPATPL PLGTVELLER MPYPCGRAPE AGTSAEALGH
ALVTAFGPHR REPENPINDH RGYASVRSKF PVHAFVADRE RRQVLDVHRN ALVDLAGANP
PQDPSADGCE ILLIGRYTRL PASYRWFRGA LVHLEMGIAL RSLCLALELF GLSGRLRLPN
SGQGRLREFG LRPAWEWTLP LTVELAERPA PVQTTGRGAD TVPDPVLADV LAMNRAQTFT
EAPATLGPAA PSQDDVTAAA ERSWAELLWS RTSGRMPRGL YGMNGRRRVL PAAVLQDAVR
WLGVPPPGDV LRGVSNAIRT TVVIQGIEGH RDGVYEVADG HTVLKAADTT AAARLEEVYG
YTLAPGNGCD VRHASMIWFL TVDPRVVVDR YGPGGWTTIQ YVCGWAAHGL TLAAAGSGLY
ARPVRAFYEA AARRVLGLGS EEMVVLAVIG GTPGYRGLML DLRS