Gene Francci3_2834 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_2834 
Symbol 
ID3904746 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp3338094 
End bp3339473 
Gene Length1380 bp 
Protein Length459 aa 
Translation table11 
GC content67% 
IMG OID637880155 
Productextracellular solute-binding protein 
Protein accessionYP_481921 
Protein GI86741521 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.873121 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCTCGAC CACGCCTGCT GCGTGCCGCG GCTGGCCTCG CGTTGCTTGC GGCGCTCGCC 
TCCGCGTGTG CGACGTCCTC GTCCACACCC ACGAGTACGG CCGCGAACGC CAGCACGATC
CCCGAACTCT CCCCGGACCA GAAGGTCTCG ATCGTCTTCG AAAGTTACAA CCTCGCGAAC
GTCGGGACGT GGAAGCCGGT GATCGAGGGG CTGCTGCGCG ACTTCCAGGC CGCGCATCCC
AACATCACCG TCAAGGGCCA GCCGCCGCAG AACCTGGCCG GTAGCGCGAA CAGCGGCGAC
TACGTCACCA GCATCAAGAA CCAGGTGCTG GCAGGCAGCC CCCCGGACGT CGCCCAGATC
ACCTTCAACG CGCTGCGCTT CGCCGCCGGC AGCCTCGGCG CCCAGCCGCT CGACACGTTG
GTCGGGCGGG AGGCCGTTCA GGCGAACTTC GGCGGCGAAC ACCCGTTCGC CCCGAAAGCC
CGCACCCTCG GCGACGTTGA CGGCAAGACC TACGCCGTCC CGTACGTCTT CTCCACCCCG
GTTCTCTGGC TGAACAAGAC CCTGTTCACC CAGGCCGGCC TCGACCCGGC CAAGCCGCCG
AAGACCTGGG CGGAGGTAAA GACCGCGGCG CTTGCCATCA AGGCGAAGAC CGGCAAGGAC
GGGGTCCTCA TCGACTGTCT GACGAAGGTC GGGGACTGGT GTTTCCAGAG CCTGGTACGT
TCGGCCGGCG GCCGAGTGAT CTCCACCGAC GGTACCAAGC TGTCCTTCGC CGATCCGCCG
GGCGTCGAGG CGGTATCGAT GGCCGCCGAC CTGGTACACA GCGGCGTGAT GCCGAACCTT
GACCAGAAGC AGCAGGTCAA GGCGTTCAGC AGCGGTCAGG CCGGGATGCT GCTCGAAAGC
AGCTCCCTGC AAGGGATGTT CATGGCCGGC GCCAAGGCGA ACAACTGGCA GCTCGACGCC
ACCCAGGAAC CCTCGTTCGG CTCGAAGCCG GTCATCCCCA CCAACTCGGG AGCGGCACTG
GCGATCTTCA GTAAGGACCC GGCCAAGCAG CGCGCCGCCT GGGAACTGAT CAAGTTTCTC
ACCGGCGACC ACGCCTACAC CGAGATCTCC TCGAAGATCG GCTACCTGCC GCTGCGGACC
GGCCTGATCG ACGACCCGAA GAGCCTGCAG GCCTGGGCGA AGGCGAACCC GCTCATCAAG
CCGAACCTCG ACCAGCTCGC CCGGATCGAG CCGTGGGAGT CCTTCCCGGG TGACAACTAC
CTCCAGATCT CGGACACGAT GATGACCGCG GTCGAAAGCG CCGTCTTCAC CGGCAAGGAT
CCCGCATCCA CCCTGGCCGC CGCACAGAAG CAGGCCACCA GCTTCCTGCC CCGGAAGTGA
 
Protein sequence
MARPRLLRAA AGLALLAALA SACATSSSTP TSTAANASTI PELSPDQKVS IVFESYNLAN 
VGTWKPVIEG LLRDFQAAHP NITVKGQPPQ NLAGSANSGD YVTSIKNQVL AGSPPDVAQI
TFNALRFAAG SLGAQPLDTL VGREAVQANF GGEHPFAPKA RTLGDVDGKT YAVPYVFSTP
VLWLNKTLFT QAGLDPAKPP KTWAEVKTAA LAIKAKTGKD GVLIDCLTKV GDWCFQSLVR
SAGGRVISTD GTKLSFADPP GVEAVSMAAD LVHSGVMPNL DQKQQVKAFS SGQAGMLLES
SSLQGMFMAG AKANNWQLDA TQEPSFGSKP VIPTNSGAAL AIFSKDPAKQ RAAWELIKFL
TGDHAYTEIS SKIGYLPLRT GLIDDPKSLQ AWAKANPLIK PNLDQLARIE PWESFPGDNY
LQISDTMMTA VESAVFTGKD PASTLAAAQK QATSFLPRK