Gene Francci3_2300 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_2300 
Symbol 
ID3904834 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp2682410 
End bp2683447 
Gene Length1038 bp 
Protein Length345 aa 
Translation table11 
GC content76% 
IMG OID637879631 
Productpentapeptide repeat-containing protein 
Protein accessionYP_481397 
Protein GI86740997 
COG category[S] Function unknown 
COG ID[COG1357] Uncharacterized low-complexity proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000752712 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0184634 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCCCGACG ATCTCGATCC CGCCGCGGGA ATCACCGGGG ACGCCCGCGG CCGGCTGCGT 
GCCGACTGCG GGCGCTGTTT CGCGCTGTGC TGCGTGGCAC CGGCGTTCTC CGCCTCGGTG
GACTTCGCGA TTGACAAACC CGCCGGCACG GCCTGCCCCA ACCTGAACGC GGGCTTCCGC
TGTGCCGTCC ACGAGCAGCT GCGGCCGCGG GGGTTCCGCG GCTGCGCCGT GTATGACTGC
TTCGGCGCCG GGCAGCAGGT CTGCCAGGTC ACCTACGGCG GGCGGGACTG GCGAGCGGCC
CCGGACAGTG CCCGGCAGAT GTTCGATGTC TTCGCGGTCA TGCGGACGCT GCACGAGCTG
CTGTGGTATC TGAGCGAGGC GCTGAGCCGG CCCGCCGCCG CCCCGCTGCA CGCCGAGCTG
CATGACGCCC GCGCTCGCAC CATGCTGCTG ACCTGCGCCG ACGCCGACAC GCTGCTCGCC
GTGGACGTCC CCGCGCTGCG TCAGGCGGTC CATCTGCTGT TGCTGCAGGC CAGCGACCTG
GTCCGCGCCG AGCTGCCGGG CGGTGCCGGG AACAGGCGTC CCCGGCGGCG CCCCCGCGGC
GCCGACCTGC TCGGCGCGAA CCTCGCCGGG GCGGATCTGC GCGGCGCGGC GCTGCAGGGA
ACCCTCCTGA TCGGCGCGGA TCTGCGGGGG GCGGATCTGC GCGGCGCCGA CCTGCTCGGC
GCTGACCTGC GCGACACCGA CCTGCACGGC GCCAACCTGC ACGGCGCCCT GTTCGTCGTT
CAGGCCCAGC TCGACGCCGC CCGCGGGGAC GCCACCACGG CGCTGCCCCC GGCTCTGACC
CGCCCCGTGC ACTGGACGCG CACGGCGGAC CCGACGCGGC CGGACCCCAG AACGCACCCC
GTGGCGGGCT CGGGCCCCGG GCCCGGGACG GTGCCGATCC CCTCGTCGTC GCCCAGCGCG
GCCCGGCCGC GGCGGTCGGC GAGTTCTCCT CGGTCGCCTC GTTCCCCTCG GTCGCCTCGT
TCCCCACGCC GCCGCTGA
 
Protein sequence
MPDDLDPAAG ITGDARGRLR ADCGRCFALC CVAPAFSASV DFAIDKPAGT ACPNLNAGFR 
CAVHEQLRPR GFRGCAVYDC FGAGQQVCQV TYGGRDWRAA PDSARQMFDV FAVMRTLHEL
LWYLSEALSR PAAAPLHAEL HDARARTMLL TCADADTLLA VDVPALRQAV HLLLLQASDL
VRAELPGGAG NRRPRRRPRG ADLLGANLAG ADLRGAALQG TLLIGADLRG ADLRGADLLG
ADLRDTDLHG ANLHGALFVV QAQLDAARGD ATTALPPALT RPVHWTRTAD PTRPDPRTHP
VAGSGPGPGT VPIPSSSPSA ARPRRSASSP RSPRSPRSPR SPRRR