Gene Francci3_0724 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_0724 
Symbol 
ID3903514 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp833482 
End bp834438 
Gene Length957 bp 
Protein Length318 aa 
Translation table11 
GC content67% 
IMG OID637878057 
Producthypothetical protein 
Protein accessionYP_479837 
Protein GI86739437 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1682] ABC-type polysaccharide/polyol phosphate export systems, permease component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.432005 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.438264 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCACATC TGGCTGACGC CGCGCTGCCG CCGGTGGCTC CGTCCGCCCC GGAGCCGCAG 
ACCGGTCCGC CAGGGCCGCC CGGGCCGGAG CACCATCACG CGCTGGACGT GAGCCGGCTG
GCCCGTCGCA GCCGGGCCCA GACGAGGGTG CGCTGGGAAC TGCTGCGCAA TCTCATCCGC
AAGGATCTCA AGGTCAAGTA CAAGGGGTCG ACGCTGGGAT TCGCCTGGTC ACTGGCGAAC
CCGATGCTCC TTCTCGTCGT GTATACCTTC GTGTTCCAGA TCGTGCTGAA ATCAGGAATC
CCCCGCTTCG GTATTTACCT GATGTCCGGG CTGCTCGTCT GGAACGCCTT CTCGGGCAGC
GTGTCCGCGT CCTGCGGCAG CGTCGTGGCG AACGCGAACC TCGTCAAGAA GGTCCGCTTT
CCCCTCGCCG TGCTCCCCCT GTCGGCGGTC GGCTTCGCCG CGGTGCACTT CCTGCTGCAA
CTGCTGGTCC TGTTCGTGGT GATCCTCGCG CTCGGCTACA GCCTGCTCGG CCCCGAGCTG
CTGCTGCTCG TCCCGGCCGG AGCCGTCGCC CTCACCTTCA CCGTGGCACT GTCCCTGCTG
GTCAGCGCGC TCAACGTGCG TTACCGCGAC ACCGCCCATC TGCTCGACGT GGCGCTGCTC
GCCTGGTTCT GGCTGAATCC CATGGTGTAC GCCTTCGGCC TCATCCAGAA CAGGTTGGCC
GACCTGACCT GGGTGTATCT GCTCAACCCG ATGGCCGTCG TGGTCATCAC GTTCCAGCGG
GCGATCTACG ATCCACCCCC GGGTAACTCG ACGGGCTCCG CCCCCATCCT GGCGAACCCC
GGTTACACGT TCTACCTGGA ACATCTCGCC GTCGCCGGGG TGATCTCCCT CGCCCTGCTG
TGGCTGGGGC TGCACGTGTT CCGCGCCCTG CAGGCGGACT TCGCCGAGGA TCTGTAG
 
Protein sequence
MSHLADAALP PVAPSAPEPQ TGPPGPPGPE HHHALDVSRL ARRSRAQTRV RWELLRNLIR 
KDLKVKYKGS TLGFAWSLAN PMLLLVVYTF VFQIVLKSGI PRFGIYLMSG LLVWNAFSGS
VSASCGSVVA NANLVKKVRF PLAVLPLSAV GFAAVHFLLQ LLVLFVVILA LGYSLLGPEL
LLLVPAGAVA LTFTVALSLL VSALNVRYRD TAHLLDVALL AWFWLNPMVY AFGLIQNRLA
DLTWVYLLNP MAVVVITFQR AIYDPPPGNS TGSAPILANP GYTFYLEHLA VAGVISLALL
WLGLHVFRAL QADFAEDL