Gene Francci3_3100 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3100 
Symbol 
ID3904226 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp3671691 
End bp3673001 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content75% 
IMG OID637880421 
Producthypothetical protein 
Protein accessionYP_482186 
Protein GI86741786 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0288555 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.686964 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTGAGC CGCTTCCGGC GCCGGTGCGT GCGCATGTCG TCGAGTTCGC CGAGCGGACG 
CTGGCGGATC TTCCCGAGTC GCAGGTCCCG CCGTCGCTGG TCGCGGTGCG CCGCTTCAAG
CCCTCCCGCC GTGTCCGCCA GGGGGCCGTC CCGCTGGCCG CGGCGGTGGA CGGTGACGTG
TTCCGCGGCC GGGTCGCCGA GTGGATCCAC CGTCATCATC CCGAGCTTGT CGAAGCCGTC
AGCTCGCCGG ACGGACCGCC GCCCGCGGCG CCCCCCGAGA AGATCGCCGC GGTCGCGTAC
CTGCTGCGGG TCTCCGGCTG GCAGGAGCTG GTCCAGGTCG CCGCGGCGTC CACGTCGGAG
GCGGCGGCCC GCAGCCAGGT TGACGAGGCC GGGCGGACGA TACTGCGGCT CACCGAACAA
CTCGAGACGA GCAAGCGCAT CGCCGCGGCC GAGCAGGAGG AACTGCGCGA GCAGCTGCAG
GCGGCCCGGG CGGAGGCGGA CGAGGCTCGC CGGCGGCTGC GCTCGTCCGC TGCCGGGATC
CGGCAGGCCG AGCAGGCCAC GCGGGAGGCC CTGACCGCGG CCGAGGCGGC CCGGAACGCC
GCCCTGGCCG CCAGTCGGGA TGCCGAGGCG GAGACCCGGC GGCTGCGCGG CCGGGTGGCG
GAGCTGGAGA GCGCGCTCGC CTCCACCCGC CGGGACAGCC GCGAGTCCCG CAGCGTCGAT
GACGCCCGGT TGCGCGTTCT ACTTGACACC CTGATCGCCT CGGCGCACGG CCTGCGACGC
GAACTCGACC TCCCGACGAT GGTCGCGAGG CCCGCGGATC TCATCGCTCG CGGCGGAGCC
GGTCCGAGTA ACGGCCCGCA GGCGTTCGTC GGGGCCCGCG GGCGGCCGGA CGACGACCCC
TCCCTGATCG ACGAGGTGTT GGCCGTCCCC GGGGTCCATC TGATCATCGA CGGGTACAAC
GTGACCAAGC GGGGCTACGG CCGCCTCACC CTGCAGGCCC AGCGTGAGCG GCTGCTGTCC
GGACTCGGTG CGCTCGCGGG CCGCAATCCC GACAGCGAGG TCACGGTCGT CTTCGACGCC
ACCGCCGTCG TGGCCCGGCC GGTGGGTGTC GCCATGCCTC GGGGGGTGCG CGTGCTGTTC
AGCCGGCCCG GGCAGCTCGC TGACGAGGAG ATCGTCCGGC TGGCGCGGAT GGAACCGGAA
GGCCGCCCGG TCTTCGTCAT CACCTCCGAC CGGGAGGTCG CGGAGAACTG CGTCGCAGCC
GGAGCCCGGG CGGTGCCCTC GGCCGCTCTG CTGGCCCGCC TCGACCGGTA G
 
Protein sequence
MREPLPAPVR AHVVEFAERT LADLPESQVP PSLVAVRRFK PSRRVRQGAV PLAAAVDGDV 
FRGRVAEWIH RHHPELVEAV SSPDGPPPAA PPEKIAAVAY LLRVSGWQEL VQVAAASTSE
AAARSQVDEA GRTILRLTEQ LETSKRIAAA EQEELREQLQ AARAEADEAR RRLRSSAAGI
RQAEQATREA LTAAEAARNA ALAASRDAEA ETRRLRGRVA ELESALASTR RDSRESRSVD
DARLRVLLDT LIASAHGLRR ELDLPTMVAR PADLIARGGA GPSNGPQAFV GARGRPDDDP
SLIDEVLAVP GVHLIIDGYN VTKRGYGRLT LQAQRERLLS GLGALAGRNP DSEVTVVFDA
TAVVARPVGV AMPRGVRVLF SRPGQLADEE IVRLARMEPE GRPVFVITSD REVAENCVAA
GARAVPSAAL LARLDR