Gene Francci3_1148 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_1148 
Symbol 
ID3903576 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp1364471 
End bp1366153 
Gene Length1683 bp 
Protein Length560 aa 
Translation table11 
GC content68% 
IMG OID637878480 
Productputative ABC transporter ATP-binding protein 
Protein accessionYP_480256 
Protein GI86739856 
COG category[R] General function prediction only 
COG ID[COG0488] ATPase components of ABC transporters with duplicated ATPase domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.493052 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGCAGT ACGTCTTCCA GATGCGCAAA GCGCGCAAGG CCCATGGCGA CAAGGTCATC 
CTCGATGATG TGACCCTGTC GTTCCTCCCC GGAGCCAAGA TCGGGGTAGT CGGCCCGAAC
GGCGCGGGGA AGTCGTCCCT GCTCAAGATC ATGGCCGGCC TCGATCAGCC GAGTAACGGC
GAGGCGACCC TGAGCCCCGG CTACACGGTC GGCATGCTCG CCCAGGAACC CCCGCTGGAC
GAGACCAAGG ACGTCCGCGG CAACGTCGAG GACGGCGTGC GCGAGATCCG CCGGGTGCTC
GCCCGCTACG AGGAGATCAA CGAGAAGATG TCCGCGCCCG ACGCGGACTT CGACTCCCTC
CTCGCCGAGC AGGCCGAGCT TATCGACAAG ATCGAGGCCG CGAACGCCTG GGAGCTCGAC
AGCCAGCTCG ACCAGGCCAT GGACGCGCTG CGGCTGCCGC CCGGCGATGC CGACGTCACC
CTGCTCTCCG GCGGTGAGCG CCGCCGGGTC GCGCTGTGCA AGCTCCTGCT TGAGGCTCCC
GACCTACTCC TGCTCGACGA GCCGACCAAC CACCTCGACG CCGAGAGCGT CGCCTGGCTG
GAGCAGCACC TCGCCCGCTA TGCGGGCGCC GTGCTGGCCG TCACCCACGA CCGGTACTTC
CTGGACAACG TCGCCGGCTG GATCCTCGAG CTCGACCGGG GCCGTGCCTT GCCCTACGAG
GGCAACTACA CCACCTACCT GGAGAACAAG GCGGCCCGGC TGAAGGTCGA AGGCCAGAAG
GACGCCAAGC GGCGCCGGGT GCTCGCCCAG GAACTCGAGT GGGTCCGGTC CAACCCGAAG
GCCCGCCAGA CCAAGAGCAA GTCGCGTCTC GCCCGCTACG AGGAGCTGGC CGCCGAGGCG
GACCGGGCGC GCCCGCGCGA CTTCGAGGAC ATCCAGATCC CGCCCGGCCC CCGGCTCGGC
AACCAGGTCA TCGAGGCCAA GGGGCTCACC AAGGGCTTCG ATGACCGGCT TCTCATCGAC
AACCTGTCGT TCACCCTGCC GCGCGGCGGC ATCATCGGCG TGATCGGCCC CAACGGCATC
GGTAAGACGA CCCTGTTCAA GATGTTGACC GGCCAGGAGG CGCCGGACGC CGGAGAGCTC
GTCATCGGCG ACACCGTCGA CATCGCCTAT GTCGACCAGA CCCGCTCGGG CCTGGACCCG
AAGAAGAACG TCTGGCAGGT CGTCTCCGAC GGCCTCGACC ACATCGTCGT CGGCAAGGTC
GACTTCCCGA GCCGGGCGTA CGTGTCGTCA TTCGGGTTCA AGGGGCCGGA CCAGCAGAAG
CCCGTCGGTG TGCTGTCCGG TGGGGAGCGT AACCGGCTGA ACCTCGCGCT CACCCTCAAG
CGTGGCGGCA ACGTCCTGCT TCTCGACGAG CCCACCAACG ACCTCGACGT GGAGACGCTG
CGCTCCCTGG AGGACGCGCT GCTGGAGTTC GCCGGCTGCG CCGTGGTCAT CTCCCACGAC
CGCTGGTTCC TCGACCGGGT CGCCACCCAC ATCCTGGCCT GGGAAGGAAC CGACGAGGAC
CCGGCGCGCT GGTTCTGGTT CGAGGGGAAC TTCGCCGACT ACGAGACCAA CAAGATCGAC
CGTCTCGGGC AGGAGGCGGC CCGCCCGCAC CGCGTCACCC ACCGCAAGCT CACCCGGGAC
TGA
 
Protein sequence
MAQYVFQMRK ARKAHGDKVI LDDVTLSFLP GAKIGVVGPN GAGKSSLLKI MAGLDQPSNG 
EATLSPGYTV GMLAQEPPLD ETKDVRGNVE DGVREIRRVL ARYEEINEKM SAPDADFDSL
LAEQAELIDK IEAANAWELD SQLDQAMDAL RLPPGDADVT LLSGGERRRV ALCKLLLEAP
DLLLLDEPTN HLDAESVAWL EQHLARYAGA VLAVTHDRYF LDNVAGWILE LDRGRALPYE
GNYTTYLENK AARLKVEGQK DAKRRRVLAQ ELEWVRSNPK ARQTKSKSRL ARYEELAAEA
DRARPRDFED IQIPPGPRLG NQVIEAKGLT KGFDDRLLID NLSFTLPRGG IIGVIGPNGI
GKTTLFKMLT GQEAPDAGEL VIGDTVDIAY VDQTRSGLDP KKNVWQVVSD GLDHIVVGKV
DFPSRAYVSS FGFKGPDQQK PVGVLSGGER NRLNLALTLK RGGNVLLLDE PTNDLDVETL
RSLEDALLEF AGCAVVISHD RWFLDRVATH ILAWEGTDED PARWFWFEGN FADYETNKID
RLGQEAARPH RVTHRKLTRD