Gene Francci3_1571 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_1571 
Symbol 
ID3904803 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp1885120 
End bp1886310 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content72% 
IMG OID637878908 
Producthypothetical protein 
Protein accessionYP_480676 
Protein GI86740276 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.443604 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGACG CCCCCACGGC GGATCAGCGG GACGTGGTGC TGGCCCTGCC GATCGAGACG 
CTCCAGGACA TGGCGATCAG GGCGTACATG CGTCCTCCCG ATCGTCTCCT GCTAACCCTG
CTGCGATCGC CCCGGGTCCG GCGCGTGGTG GTGGCCGAGC CGTTCCGGAG CCATCTCGGT
ACGGTGCTGC GGGGCGGCCG GAGCACCGTC CTGCCACCGT CCGGTGGCGT TGAGGGGCAT
CTGGTGTCGC CGCAGCGGTG GCGCCGCAAG GACCCGGTGA CGCTGCCGTC GCTGCGGGCG
GCCTATCGGC GGTATGACAG GAGGCTCGGC CGGGCCGCGG CCCGAGCCGG CTGTAAACGA
CCCGTCGTGA TCACCATGTA TCCGCCGCTG GCGGGCTTTG CCGACATGTC CTGGGCCGGC
TCGGTCATGT ACTACGCCCG GGACGACTGG GCGACCTACC CACCGCTACG GCGGTGGCAT
CCGGCCTTCC GGCATGCCTA CGAGGAGATC CGGCGCCGGC GGCTGCCGGT CATCGCGGTG
TCCAGGCCGC TGCTGGAACG CCTCCATCCC ACCGGTGCCG GGCTGGTCGT GCACAACGGT
GTCGATCCGG CCGAGTGGCT GCGCCCGCCG TCCCCGCCGG ACTGGCTCCG GCGCCTCCCG
CGGCCGTGGT GCGTGTATGC GGGCACCGTC GACACTCGCC TCGATCTGGA CATGATTCGC
CGCCTGGCGT CGGCCGGCAC CGTGATTCTG GCCGGCCCCA TCCCGGACGA GGCCTCCGTC
CGGTCGCTGC GGTTGCTGCG GTCGGTGCGG TTGCCTGGGC ATCTGCCCCG GCCGGCCGTG
ACCGGTCTGA TCGCCGCGGC CGACGTGTGC CTGCTCACCC ACCGGAGCAC TCCGTTGACC
GAGGCCATGG ACCCCATCAA GATCTACGAA TATCTGGCGG CCGGGCGTCC CGTCCTCGCC
ACGGACCTCG CCCCGGTTCG GGGCATCGGG CGGCGGGTCC GGCTGCTGCG CCCGGGGGAC
GATCCGGTGG CGGCGATGAA CGAGGTCCTG ATCTGGCCGG CCGTCACGGA GGCTGACCGG
CTGGATTTCG TCGCGGACAA CAGCTGGTCC GCTCGGCACG TCGCCTTCCT GGACTTCGTC
CTTGGCCCGG CCGCGCCGGC CGAATCCCGT CTTCTGGCGG TGCAGGCGTA G
 
Protein sequence
MKDAPTADQR DVVLALPIET LQDMAIRAYM RPPDRLLLTL LRSPRVRRVV VAEPFRSHLG 
TVLRGGRSTV LPPSGGVEGH LVSPQRWRRK DPVTLPSLRA AYRRYDRRLG RAAARAGCKR
PVVITMYPPL AGFADMSWAG SVMYYARDDW ATYPPLRRWH PAFRHAYEEI RRRRLPVIAV
SRPLLERLHP TGAGLVVHNG VDPAEWLRPP SPPDWLRRLP RPWCVYAGTV DTRLDLDMIR
RLASAGTVIL AGPIPDEASV RSLRLLRSVR LPGHLPRPAV TGLIAAADVC LLTHRSTPLT
EAMDPIKIYE YLAAGRPVLA TDLAPVRGIG RRVRLLRPGD DPVAAMNEVL IWPAVTEADR
LDFVADNSWS ARHVAFLDFV LGPAAPAESR LLAVQA