Gene Francci3_3337 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3337 
Symbol 
ID3904123 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp3954344 
End bp3955492 
Gene Length1149 bp 
Protein Length382 aa 
Translation table11 
GC content73% 
IMG OID637880662 
Producthypothetical protein 
Protein accessionYP_482423 
Protein GI86742023 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0204468 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCCTGA CCGCCGAGAA GCCCCCGACC GCCCCGCCAC CCTCGACCGG AACGATCACA 
CCGCTGGCGG GCCCGCCAGC ACCACGACCC GCCGCGCCGC CACCCACCAG CCCGACCCGC
GGGAATTCCA CCGGCGGAGA TGCCGGGAGC GGGAATGCCA TCGACAGAAG CACAGCAAGC
GACGGCTCCC CTGCGGCAAC CGACGACCCG ACGACCACGC AGCCCGCGCC CCCACCGGCG
ACGCCTTCAC CGACCCAGGC CGACGACATG AACCGCGCCG ACGACACGAA CCGCGCCGCC
CTCGCGCCCC CAACCGCGCC GACGCGCCCA CCTGCCGCGC CCTCCTCGCC GGCCGCGAAC
GCGCCGGCAG GAAATCCAGA CCGGCACGAC CCGACGTCAG CGCCGTCCCG GCGGCGGCGC
TGGCGAACCG GCGATCTCTG GATCCATATC GCCACCGTCA TCGCCGTCCT CGGCGTCGCC
GGCATCGCCG CCGTCGTCAG CTACCGCCAC ATGCGCGCCG TCGCCATCCT CCACGGCGAG
AACCCCGCCA ACGCCGCGAT CATCCCGCTG TCCGTTGACG GCCTCATCGT CGCCGCCTCC
ATGACCATGC TCGCCGACAG CCGCGCTCAC CGACACCGGT CCTGGCTCGC CTACAGCCTG
CTCACCCTTG CCTCCGCCGC CAGCCTCGCC GCCAACGTCA TGCACGCCGA ACCCACCCTC
GCCGCCCGCG TCATCGCCGC CTGGCCCAGC GCCGCCCTCA TCGGCGCCTA CGAACTCCTC
ACCGCCCAGA TCCGCGGCGC CGTCACCACC CAGACCCACC CCGCCGCCCC ACCCGCTCCA
GCCGCCGCGC CGACTTCCGC CCCCGCGCCC ACTCCAGCCG CCGCGCCCCC GGCGCCCGGC
GCCCCGCCGA GCCCCGAACC CACGACGATC ATCACCCCCG AGAAGAACGG GAACGATCCC
GGAACGAAAA CAGCATCCCA GCCGGTCACG GTCAGACCCG GCACGAAGAA GGCCAGGCTC
CAGAAGCTCC TCGAAGCCCT GCCCGCCAAC GACCCCCGGT CCGTCTACGC CCTCGCCAAA
GACCTCGCTC CCCTCATCGG CCTGAACGAA GGCACCGCCC GCCGCTACAT CCCCCACCTC
AGGTCATGA
 
Protein sequence
MILTAEKPPT APPPSTGTIT PLAGPPAPRP AAPPPTSPTR GNSTGGDAGS GNAIDRSTAS 
DGSPAATDDP TTTQPAPPPA TPSPTQADDM NRADDTNRAA LAPPTAPTRP PAAPSSPAAN
APAGNPDRHD PTSAPSRRRR WRTGDLWIHI ATVIAVLGVA GIAAVVSYRH MRAVAILHGE
NPANAAIIPL SVDGLIVAAS MTMLADSRAH RHRSWLAYSL LTLASAASLA ANVMHAEPTL
AARVIAAWPS AALIGAYELL TAQIRGAVTT QTHPAAPPAP AAAPTSAPAP TPAAAPPAPG
APPSPEPTTI ITPEKNGNDP GTKTASQPVT VRPGTKKARL QKLLEALPAN DPRSVYALAK
DLAPLIGLNE GTARRYIPHL RS