Gene Francci3_3007 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3007 
Symbol 
ID3905504 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp3567506 
End bp3568780 
Gene Length1275 bp 
Protein Length424 aa 
Translation table11 
GC content72% 
IMG OID637880327 
Producthypothetical protein 
Protein accessionYP_482093 
Protein GI86741693 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.153123 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.294126 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATTCTC GGTCCATGAC GTTTTCCGCT TCGGCCTCGT CCCCGCCGAC CCGTCCCGTG 
CGCTATCCCG GTTTCGACCC CGCGGTCCGC ATCGCCGGTC GCCAGGGGGG TGCGATCACC
TACCGGCAGG CGATGGCCGC TGGGTTGACC CGCGGCCAGC TGCGCCAGCT CGTCCACTCT
GGCCAGTGGA GCCATCCGGT GCGCGGTGTC TTCGTCGTAC CTCTCGGCCC GACGGAGCTC
CCCGGCTCCG TCAACGACGT CCCGCGGCCG GGGGACCGGC GGCGCGCCGA GCATTCTGAA
ATCCGGGAGA AGGAATCCGC CGGACTTGTC CGGAACGACG TCGCGCGCCG ACGAAGAAAG
CGCCATGGTG CCATCACCGC GGGCCCCGGA CACGCGGCGT CGGTCGTCGG CGGTGAAAAT
ACAACGCCGA TTCTCCCGGT CTTCTTCTCG CCATTTTCGG CACGGGTAAG GGCCGCGCTC
ATCGGGCGTC CCCGAGCCGT CGTCTGCGGG ATCACCGCCG CCCGGCTCCA CGGTTTTCCG
CTCGAGGTTC CGGAAAGCTC CGCCGAACCC GTGCACCTTC TTCTTCCGGC TCGGCAGACC
CGGGCCCAGC CGCGCGGGAT CCGGCTGCAC TTCAGCGATC TCGACGTCGA CCAGCGCGTC
GAGTTGGGTG GGATCCCACT CACCTCGCCG GAGCGGACGC TGGCGGATCT CGTGCTGGCC
GCCCAGTCGC GTGAGGTGGC GGTCGCCCAC CTGGACGCAG CCCTGCACCG CGGCCTGGTG
CCGAGCCTGG CGGGGGCACG GGCGGCCGCC GAGGGGCGGC GCGGTTTCCG GCAGACGACC
GACTGGTGGT CGCTGGCCGA CGGCCGGGCC GAGACCCCCC TGGAGACCCG GCTGCGGCTG
CTGTTGGCCG ACAACGGGCT CGCGCCCGTG GAATTGCAGT GGCCGGTCAT GGACGGGACC
GGTCAGATCA TCACCCGGCT GGATCTCGCC TGGCCGGCGC AGCGACTCGA CGTGGAGGCG
GACACCTTCT CGGCCACCAG CCCGCCGGCG ATGATCTACC AGGACCGTCA TCGCGGCAAC
ATCCTCGCCG CGTTGCGCTG GACGGTGCTG CGCTTCAGCG TGGCCGACGT GACCTGGTAT
CCGGAGCGGG TCGTCTCGGC GGTGACCCGG GTCCTGGCGG CGCGGGCGGC CGAGCGGGCC
GAGGCGTCCA GAGCGGTCGC TGAGGCGTCC GCCGCCGGCA CAGCCGCGGT GGCTGAGCGG
CTGTGGGCAT CGTGA
 
Protein sequence
MDSRSMTFSA SASSPPTRPV RYPGFDPAVR IAGRQGGAIT YRQAMAAGLT RGQLRQLVHS 
GQWSHPVRGV FVVPLGPTEL PGSVNDVPRP GDRRRAEHSE IREKESAGLV RNDVARRRRK
RHGAITAGPG HAASVVGGEN TTPILPVFFS PFSARVRAAL IGRPRAVVCG ITAARLHGFP
LEVPESSAEP VHLLLPARQT RAQPRGIRLH FSDLDVDQRV ELGGIPLTSP ERTLADLVLA
AQSREVAVAH LDAALHRGLV PSLAGARAAA EGRRGFRQTT DWWSLADGRA ETPLETRLRL
LLADNGLAPV ELQWPVMDGT GQIITRLDLA WPAQRLDVEA DTFSATSPPA MIYQDRHRGN
ILAALRWTVL RFSVADVTWY PERVVSAVTR VLAARAAERA EASRAVAEAS AAGTAAVAER
LWAS