Gene Francci3_1641 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_1641 
Symbol 
ID3905920 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp1973361 
End bp1975127 
Gene Length1767 bp 
Protein Length588 aa 
Translation table11 
GC content73% 
IMG OID637878979 
Productextracellular solute-binding protein 
Protein accessionYP_480746 
Protein GI86740346 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0558371 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCCGTGC TCGCCACGAT CCTGCTCGCC ACGGTCCTGG CTGCCACGGT GACCGCCTGC 
AGCGGTGGCT CGGGGGACGG ATCGCGTGAG CTCGCTGCCA GTCCCACCCC GGCGACGACC
ACCACGCCCC CTCCGTCCGG CAAGCCGGGC GGCACGCTGC GGATCGTCAC CCAGTGGATG
CCCAGTGGCG ATCCGGGCTG GGCCGACCAG CCCGGCGAGC GGGCGGTCAG CCGGCTCGTG
ACCCGCCAGC TGTACAGCTA TCCCTCCGAC GAGGACACGA CGAAGTCGAC CATCCCGCGG
CCGGACCTCG CCGTCGGCGC GCCGGTCGTC ACCGAGAACG GGCTGGTCTA CACGGTGCGG
CTGCGTCCCG CGGCGCGCTG GGACACCCCC AACCAGCGCC GGATCACCGC CAACGACGTC
GCCCGCGGCA TCAAGCGACT GTGCACGCCC CCGAACCCGT CACCGCTGCG CGGCTACTTC
ACGGCGACCA TCGTCGGATT CCGGGAGTTC TGTGCCCAGC TGGCGGCGAC CCCGGTCGCC
GACGCCGCGG CCTTCGTCGA GAGCAGCACC GTGGAGGGCA TCGAGATCGT CGGCGACGAC
ACGCTCGCGT TCCACCTACT GGCCCCGGTG AACGACTTCG TGGACGTGCT GGCGCTGCCG
GCGGCCTCCC CGGTGCCGCT GGAGGCCCTG GCCTACCCGC CGGACTCGCT GGAGTACCTG
CACAACCTGG TCTCGGCCGG GCCGTACCGG TTCACGGTCG CCCCCGGCGA GGGGTACCGG
CTGTCGCGCA GCCCATCGTG GAGCGCGTCC TCGGACGGGA TCCGCCGCGC CCTGCCCGAC
CATATAACGA TCTTCGACGG CCTGAGTCCG GAGGCCATGC AGCAGGAGCT GGAGAGCGGC
GACGCGGACA TGTCGCTCGA CGGGAAGATA CCCGACAGCC GGGCCGTGGA GCTCGCCAGG
GCGAACGACC CCCGTCTCGT GGTCGACGGG GTCGGGGTCA CCCTGGCGCT CACCGTCGGA
TTCAATGGGC CCTCCGCGGC CGCGCTGCGT GAGCTGTCGG TCCGCCGGGC GCTGCCCTAC
TGCATCGACC GGGTGAGCCT CGCGGCGGCG CTCGGCGGCC CCGAGTTCGC CGCTGCCGCC
ACCGGGTTGC TGCAGGAGAC GATGACCGGC TACACCGACG CGGATCCCTT CCCGAGCCCG
GCGGGACTCG GCGACGCGGC CCGTTGCCGG GAGGCGCTGA GCCATACCCC CGGCGGACCC
GTCACGGCGC TGTCCCTGCT CACCACGGAC AGCGCCACCG ACGTGGCGGC CGCCGAAGCA
CTGCGGACCG CCTTCGCCCG TTCCGGTATC CGCCTCGACA TCCGGATCCG GACCGGCGAG
CGCTACCGTG CGGCAGCCGT CCACCCGACC GGGCAGTTCT GGGACCTGGC GCTCACTACT
ATCGCCCCGG ACTGGTACGG CGACGCGGGC CGCACCGTCT TCCAGCCGTT GCTGGACGAG
ACGTGGGCCG GCCCCAGGCC GGCCGACGGC GGCTACCGGG ATCCGGGCGC CCTGCATCTG
CTCGCCACGG CGCTGCGGGC CACCAGCGAG GCGACCGCCG CGAGCAACTG GGCCGACCTG
GAGCACACGC TGGTCGAGCA GGTCGCGGTG ATCCCGCTGG CCGTCGTGCA CACCCCGCAG
TTCCACAGCA CGAACGTCAC CGCGTTCACG ATCGTGCCGT CGATCGGCAC CGCCGATCCG
ACCGCGGTCT CGCTGGGGTC CGGATGA
 
Protein sequence
MSVLATILLA TVLAATVTAC SGGSGDGSRE LAASPTPATT TTPPPSGKPG GTLRIVTQWM 
PSGDPGWADQ PGERAVSRLV TRQLYSYPSD EDTTKSTIPR PDLAVGAPVV TENGLVYTVR
LRPAARWDTP NQRRITANDV ARGIKRLCTP PNPSPLRGYF TATIVGFREF CAQLAATPVA
DAAAFVESST VEGIEIVGDD TLAFHLLAPV NDFVDVLALP AASPVPLEAL AYPPDSLEYL
HNLVSAGPYR FTVAPGEGYR LSRSPSWSAS SDGIRRALPD HITIFDGLSP EAMQQELESG
DADMSLDGKI PDSRAVELAR ANDPRLVVDG VGVTLALTVG FNGPSAAALR ELSVRRALPY
CIDRVSLAAA LGGPEFAAAA TGLLQETMTG YTDADPFPSP AGLGDAARCR EALSHTPGGP
VTALSLLTTD SATDVAAAEA LRTAFARSGI RLDIRIRTGE RYRAAAVHPT GQFWDLALTT
IAPDWYGDAG RTVFQPLLDE TWAGPRPADG GYRDPGALHL LATALRATSE ATAASNWADL
EHTLVEQVAV IPLAVVHTPQ FHSTNVTAFT IVPSIGTADP TAVSLGSG