Gene Francci3_2289 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_2289 
Symbol 
ID3904823 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp2669805 
End bp2671262 
Gene Length1458 bp 
Protein Length485 aa 
Translation table11 
GC content74% 
IMG OID637879620 
Productmajor facilitator transporter 
Protein accessionYP_481386 
Protein GI86740986 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00392793 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000728467 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGACCCGG GCGTGCGTAC CTCCGCATCC CGCAACACCC TGATGTCGCT GCTGGCCCAC 
GAGCGGGCCC GCCCTCGCGC GGTCCGCGAC CTGCCGGGCG GCTGGTGGCT GGCGGTCGCC
ACCGTGTGCT TCGGCGCGTT CATGGGCCAA CTCGACGCCA GCATCGTCAC CCTGGCCTAC
GGGCCGCTGT GCGCACAGTT CCACGCCCCG CTCGCTGCAG TCACCTGGGT CTCCCTGGCC
TACCTGCTCA CCCTGGTCGC CCTGCTGGTG CCCGTCGGCC GGCTCGCCGA CGCCCACGGC
CGCAAACAGT TCTACCTCTA CGGACTCCTC GTCTTCACCG CCACCTCGGC CGCCTGTGGC
CTGGCTCCCA GCCTCGCCGC TTTGATCGGT TCCCGCGTCG CGCAGGCCGT CGGCGCCGCG
ATGCTGCAGG CCAACAGCGT CGCCCTTGTC GCCACCAGCG CACCCCGCCC GAGGATGCGC
GCCGCCCTGG GCGTCCAGGC CGCCGCCCAG GCCCTCGGCC TCGCTCTCGG CCCCACCCTC
GGCGGAGCCC TGGTCACCAC CCTCGGCTGG CGCTGGGTGT TCGCCATCAA CGTCCCCGTC
GGGACAATCG CCCTCATCGC CGGCTACTAC CTGCTACCCC GCACCCGCCA GCGCACCGAC
CCTGCCCCCT TCGACTGGCC GGGCCTGGCC CTCCTCGCCA CCGCCACCAC CACCCTGCTG
CTGGCCATCT CGGCCGTCTC CGGCCTGAAC CTGCCCTCCG CGGCCACCGC CATCCTCGCC
ATCCTCGCCC CGCTCGCCGG CTACGGCCTC GTCCAACGGG AACGCCGCGC GCCGGCACCC
CTGATCGACC TGCGCCTCCT TCGCATCCCT GCCCTCGCGG GGGGACTCGT CGGCGCGCTG
TGCGGCTACC TCGTCCTGTT CGGCCCGCTC GTCCTGGTCC CCGTCGTCCT CACCGACCGC
GGCACCTCCC CCCTGCACGC CGGGCTGGTC CTCACCGCCC TGCCCGGAGG CTTCGCGCTC
GCCGCCAGCG GCGCCGGGGC GGTCCTACCC GACCGGTGGA GCGACCGCCG CCGCTACGCG
CTCGGCGCCG TCACCTGCAC GATGGCACTC GCCGCCGCAC TCGCCGTGCC ACTGTCCGCA
CCCTGGCTGA TCCCCCCGAT GGCCATGCTC GGCCTCGGCC TCGGCGTCTA CACCCCCACC
AACAACACCA CGATCATGAG TGCGATCCCG GCCCACGCGT CGGGTACCGG CGGCGGACTG
GTCAACATGA CCCGCGGTCT GGGCACCGCC CTCGGCGTGG CCATGGTCAC CCTCGCCGTC
CACCTCGCCG CCGGCGCCAC CGGACCCCGC CTGGCAATCG TCGGCCTCAC CGCGGCATCC
CTCCTCCTGT TCACCCCCCT CCTCACCCCT AGCCGGGGCA TGAGGAACAT CGGAACGGAA
AACAGCGCCA GGTGCTGA
 
Protein sequence
MDPGVRTSAS RNTLMSLLAH ERARPRAVRD LPGGWWLAVA TVCFGAFMGQ LDASIVTLAY 
GPLCAQFHAP LAAVTWVSLA YLLTLVALLV PVGRLADAHG RKQFYLYGLL VFTATSAACG
LAPSLAALIG SRVAQAVGAA MLQANSVALV ATSAPRPRMR AALGVQAAAQ ALGLALGPTL
GGALVTTLGW RWVFAINVPV GTIALIAGYY LLPRTRQRTD PAPFDWPGLA LLATATTTLL
LAISAVSGLN LPSAATAILA ILAPLAGYGL VQRERRAPAP LIDLRLLRIP ALAGGLVGAL
CGYLVLFGPL VLVPVVLTDR GTSPLHAGLV LTALPGGFAL AASGAGAVLP DRWSDRRRYA
LGAVTCTMAL AAALAVPLSA PWLIPPMAML GLGLGVYTPT NNTTIMSAIP AHASGTGGGL
VNMTRGLGTA LGVAMVTLAV HLAAGATGPR LAIVGLTAAS LLLFTPLLTP SRGMRNIGTE
NSARC