Gene Francci3_3908 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3908 
Symbol 
ID3906676 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp4677029 
End bp4678180 
Gene Length1152 bp 
Protein Length383 aa 
Translation table11 
GC content74% 
IMG OID637881234 
Producthypothetical protein 
Protein accessionYP_482987 
Protein GI86742587 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.409846 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACGTCAC GGAGTGCAGA CCAGGAGTCC GAGCGACGGG CGGACGCGGC ACGCCTCCGC 
CCACTCCCGC CGGCGGATGT ACCCGGCGCC ACCCGGGCCC GCCCACGCCC GCCGGCGCGA
TCGAGGCTCA CCGTCGCGAT CCTGCTGACG GCGGGCGTCG CGCTGGCACC CGCGGCCCTT
CCGGCCAGCC CCGCCCAGGC GGCGTCGGAC ACCGGGCGAG GGGGCGCGCC GATCGCCGCC
ACGTACCCGG GATCCACCGC GGGACAGCTC CTCTCGGCGC CGGGGGCGGT GACCCTCGCC
GAGCAGCGCG AGGTCGAACG GTACTGGAGC GCGCAGCGGC GGGCCCGAGC CTTCGGGGCC
CGCACGAACG ACCATGCCGA TGCCGCCCCC CTCAGTCCCG GGCCGGCCCC GCCTGCCGCC
GGCGACTCCG AAACGGCCGC GAGCGACGCT CCCCCGCCCC CGAGCGCCGG AACGCCCTAC
ACCCGGGGCG GGCTGGTGAC CACGACCACC GGCCGGTTGT TCGCCACCAT CGGCGGCAGT
GACTACGCCT GCTCCGCGAG CGTCGTGTCC AGCCCCAGCC GTGACCTCGT GGTCACCGCG
GGGCACTGCG TGCACGGCGG TAGCGGGGAA CAGTTCGCGC GGAACGTCAT CTTCATCCCC
GGGTTCGACA ACGGCGCGAT GCCCTACGGC ATCTGGACCG CCCGCCGGCT TACCGTGACG
TCCGGGTGGG CCCGCCAGGA CGACTTCGAC GTCGACACCG GCTTCGCCCT GTTCAACCCG
TCCCCGTCCC GCGGCCGTCA TCTCGAGGAC ACGGTCGGAT CCCAGGGCAT AGCCTTCGAT
CTTCCGGGGA CGTATCCGCA GTACACCTTC GGCTACCCGC GGCTGCCCCC CTACGACGGC
AGCCAACTCG TCTACTGCGC CGGCCCCGGC TTCGGCGACC CCTACGGCAG CCCGTCGATC
GGCGTCGCCT GCCGCATGAC GGCCGGCGCC AGCGGCGGGC CACTACTCAC CGGCCTCGGC
CGGCTCGGTT CCGGGCACGG CTGGGTCGAC GGGGTGGTCA GCTACGCCTA TGCCGGGGTC
AACGACAAGC TGTACGGAAC TCACTTCGGC GACGTGGTGA AGTCGCTCTA CTACCAGTCC
TACCGGCTCT AG
 
Protein sequence
MTSRSADQES ERRADAARLR PLPPADVPGA TRARPRPPAR SRLTVAILLT AGVALAPAAL 
PASPAQAASD TGRGGAPIAA TYPGSTAGQL LSAPGAVTLA EQREVERYWS AQRRARAFGA
RTNDHADAAP LSPGPAPPAA GDSETAASDA PPPPSAGTPY TRGGLVTTTT GRLFATIGGS
DYACSASVVS SPSRDLVVTA GHCVHGGSGE QFARNVIFIP GFDNGAMPYG IWTARRLTVT
SGWARQDDFD VDTGFALFNP SPSRGRHLED TVGSQGIAFD LPGTYPQYTF GYPRLPPYDG
SQLVYCAGPG FGDPYGSPSI GVACRMTAGA SGGPLLTGLG RLGSGHGWVD GVVSYAYAGV
NDKLYGTHFG DVVKSLYYQS YRL