Gene Francci3_2999 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_2999 
Symbol 
ID3905496 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp3552610 
End bp3554121 
Gene Length1512 bp 
Protein Length503 aa 
Translation table11 
GC content73% 
IMG OID637880319 
Producthypothetical protein 
Protein accessionYP_482085 
Protein GI86741685 
COG category[R] General function prediction only 
COG ID[COG1253] Hemolysins and related proteins containing CBS domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.280207 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.727721 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCGAGG CGCTTCTGTT GGTACTCAGC CTGGTGCTCG TGGCCGCCTG CGGCGTTTTC 
GTCGCGGCCG AGTTCGCTTT CGTCACGGTG GACCGGCCCT CGGTGGAGCG GGCCGCCGAA
CGCGGTGATC GGGGCGCCCG CGGGGTGCTC ACCGCGCTGC GCGGTCTGTC CACCCAGCTG
TCCGGCGCCC AGCTCGGGAT CACCGTCACC AACCTGATGA TCGGTTTTCT GGCGGAGCCC
GCCATCGCAC GGCTGCTCGA GGGGCCGATC ACCTCGCTCG GCGCCTCGGA CGGGCTCGCC
CGGGCGGCGT CGGTGGCGAT CGCTCTGGTC CTCGCGACGG GCCTGACGAT GGTGTACGGC
GAGCTGCTCC CGAAGAACCT GGCGATCGCC CATCCCCTCG GCACCGCGTT GGCGGTGCAG
GCTCCCCAGC GTGCCTTCAC GAAGGCCACC GGCCTGCTGA TCCGCTCGCT GAACATCACG
GCGAACGCGG TGCTGCACCG CCTCGGCATC TCGCCACGCG AGGAGCTGGC CTCGGCCCGT
TCGGCTCAGG AGCTGTTCTC CCTCGTCGGG CGGTCCGCCG AGCACGGCAC CCTCTCCCAC
GAGACGGCGA CGCTGGTGCA GCGCTCCCTG CTGTTCGGTG ACCGGACCGC CGAGGACGTC
ATGACACCCC GGATGCGCAT GCGCACCATC CACGCCGACG AACCGGTCAG CGAGGTCATC
ACCCTCACCC GGCGCACCGG GCACTCCCGC TTCCCGGTAC TCGGCACGGA CAGCGACGAC
GTCGTGGGCC TCATCCACGT GAAGAACGCG GTCGCCGTCC CCGAAGACGC CCGGGACCAT
ACCCCGGTAC GCGACGTGAT GGTCCCGCCG GTGACCGTGC CCTCGACGAT CCTGCTCGAC
CCCCTGCTGG AGACACTGCG TGCCGGCGGC ATGCAGATAG CGATCGTGGT TGACGAGTTC
GGCGGCACCG ACGGGCTGGT CACCGCCGAG GACCTCATCG AGGAGATCGT CGGCGACGTC
GTCGACGAAC ACGACCGGGT CAGCCCGCGC GCCCTGCGCC GGCGGGACGG CAGCTGGCTG
CTCTCCGGCC TGCTGCGCCC GGAGGAGGCC CGCAACGTCA CGGGGATCGA CATCCCCGCG
GACGACACCT ACCAGACCCT GGGCGGTCTG ATGGCCCGGG CGCTGGGACG CATCCCCCGG
GCGGGCGACA CGGCGGCCGT GGAGGGCGTG CGGTACACCG TCGAGCGGAT GGACGGCCGG
CGGGTCGACC GGATCCGTCT GGGCCCGATC GCACCGACGG ACGCCACGCC GGAGACGGAT
GCGGATCCAC CGGTCGCGGA CCCCGCGGAT CGCCGCCCCG GCGCAGACCC GGCAGACCCC
GCAGACCCCG CAGACCCCGC AGACCCCGCA GACCCCGCAG ACCCGGCAGA CCCGGCAGAC
CCGGCAGACC CGGCCGCCGA GGGGGCCGAG GAGGCCGAGA CCGCCTCGGC CGGAAGGGCG
GGGTGGGGAT GA
 
Protein sequence
MIEALLLVLS LVLVAACGVF VAAEFAFVTV DRPSVERAAE RGDRGARGVL TALRGLSTQL 
SGAQLGITVT NLMIGFLAEP AIARLLEGPI TSLGASDGLA RAASVAIALV LATGLTMVYG
ELLPKNLAIA HPLGTALAVQ APQRAFTKAT GLLIRSLNIT ANAVLHRLGI SPREELASAR
SAQELFSLVG RSAEHGTLSH ETATLVQRSL LFGDRTAEDV MTPRMRMRTI HADEPVSEVI
TLTRRTGHSR FPVLGTDSDD VVGLIHVKNA VAVPEDARDH TPVRDVMVPP VTVPSTILLD
PLLETLRAGG MQIAIVVDEF GGTDGLVTAE DLIEEIVGDV VDEHDRVSPR ALRRRDGSWL
LSGLLRPEEA RNVTGIDIPA DDTYQTLGGL MARALGRIPR AGDTAAVEGV RYTVERMDGR
RVDRIRLGPI APTDATPETD ADPPVADPAD RRPGADPADP ADPADPADPA DPADPADPAD
PADPAAEGAE EAETASAGRA GWG