Gene Francci3_1324 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_1324 
Symbol 
ID3906596 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp1586224 
End bp1588230 
Gene Length2007 bp 
Protein Length668 aa 
Translation table11 
GC content69% 
IMG OID637878657 
Producthypothetical protein 
Protein accessionYP_480430 
Protein GI86740030 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0531] Amino acid transporters 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.112475 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.347698 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCGCTCA CGGACCGCCT CAAGCGCGGT TTCGTCGGTC GACCGATCGG GAGCGACCGG 
CTTGGGGAGA CATTGCTCCC TAAACGCATC GCTTTGCCCG TCTTCGCGAG CGATGCCCTT
TCCTCGGTGT CGTACGCGAC CGAGGAGATC CTGCTTGTCC TGTCGCTGGG TGGGCTGGCC
TTCTACCACA TCTCGCCCTG GCTTGCTGGA GCCGTCGCCA TCCTGATGCT GACGGTGGTG
GCGTCCTACC GGCAGAACGT GCACGCCTAC CCGAGCGGCG GCGGCGACTA CGAGGTCGTC
TCGGTCAACC TGGGCCCGCG GGCCGGCCTG CTGGTGGCCA GTTCCCTGCT GGTTGACTAC
GTCCTCACGG TCGCGGTGTC GGTGTCGGCC GGTGTCGCGA ACCTGACGTC CGCTATCACC
GGGCTCGCCG CGCACAAGGT GCTCCTCGCG GTCGTCATCG TCGTCCTACT GACGATAATG
AACCTGCGTG GGGTGCGCGA GTCCGGCACG GCGTTCGCCA TCCCGACCTA CGGCTTCGTG
CTCGGCATCT TCGTAATGAT CGTCACCGGG CTGGTCCAGG CGGCCGTCGG TCATCCGCCG
CGGGCCGAGA GTGCTGGGTA CCAGGTGGTC GCCGAGCGTG ACTACGCGGG ATTCGCCCTG
GTGTTCCTGG TGTTGCGGGC GTTCGCCTCG GGATGCACGG CGCTCACCGG GGTCGAGGCG
ATCAGTAACG GGGTGCCCGC GTTCCGGAAG CCCAAGAGCC GCAACGCCGC GACCACCCTG
CTGATGCTGG GGCTTATCGC AGTCACGATG TTCGGCGGCG TCACCGCGCT GGCTCTGATC
TCCGACGTGC ATGTCGCTGA GCACACCGGG GACCTGATCG GCGCCAGTGG TGAGCAGCGG
ACCGTGATCG CCCAGGTCGC CGCCGCCGTC TTCGGGGACG GATCGCCCGG CTTCGGCTAC
ATCGCCGTCG TCACCGCTCT CATCCTGATG CTGGCGGCGA ACACCGCCTT CAACGGCTTC
CCGGTGCTGG GTTCGATCCT CGCCCGCGAC GGCTACCTGC CGCGTCAGCT ATACACCCGC
GGTGACCGAC TGGCCTACTC CAACGGGATC GTGCTGCTCG CCGGCTTCGC GATCCTGCTG
ATCGTCGTGT TTGACGCGCA GGTCACCGCC CTGATCCAGC TCTACATCCT CGGGGTCTTC
ATCTCGTTCA CCCTCAGCCA GACCGGGATG GTGCGGCACT GGGCACGCAT CCTGCGCTCG
GACGACCCGG CGGCGTCCGA TCCGGCCGCC CGCCGGCGGA TCCGCCGGTC TCAGGCGATC
AACTTCTTCG GCGCCTGCCT CACCGGCACC GTGCTGATCC TCGTGCTGGT CACGAAGTTC
ACCCACGGTG CCTGGATCGT CTGTCTGGCT ATCCCGATCA TCTTTCTCGG GATGCGGGGG
ATCAGGGCTC ACTACGACCG GGTCGCGGTC GAACTCACCC CGGAACCCGG GCCGCCGACC
CTGCCCTCCC GGATCCACGC CGTCGTCCTC GTCAGTAAGA TCCATGCCCC GACACTGCGG
GCCCTGGCCT ACGCCAAGGC GTCGCGGCCG CACAGCCTCG TCGCGGTCAC GGTCGCCGTC
GACCAGGCGG AGGCCGATCG GCTCCGCAAG GCGTGGACGG AGCGGGGGAT CACCGTCGAT
CTCGTGGTGC TGGCCTCCCC CTACCGGGAG GTGACCCGCC CGGTGCTGGA CTACGTCGCC
CGGATCAGGC GGGAGAGCCC CCGCGACGTC GTCGCCGTCT ACGTCCCGGA GTACGTCGTC
GGTCACTGGT GGGAGCATCT CCTGCACAAC CAGAGCGCGC TGCGGCTCAA GGCCCGGCTG
CTCTTCCAGC CCAGCGTCAT GGTCACCAGC GTGCCCTGGC AGCTCGCCTC GTCCAGGCTG
GCCGAGCAGC GGTTCGAACG AACCGGGGCG GGCGCGGTGC GACAGGGCCG GGCCACGCTT
CCCGGCCCGT TGGAGCGGAA GCGGTGA
 
Protein sequence
MALTDRLKRG FVGRPIGSDR LGETLLPKRI ALPVFASDAL SSVSYATEEI LLVLSLGGLA 
FYHISPWLAG AVAILMLTVV ASYRQNVHAY PSGGGDYEVV SVNLGPRAGL LVASSLLVDY
VLTVAVSVSA GVANLTSAIT GLAAHKVLLA VVIVVLLTIM NLRGVRESGT AFAIPTYGFV
LGIFVMIVTG LVQAAVGHPP RAESAGYQVV AERDYAGFAL VFLVLRAFAS GCTALTGVEA
ISNGVPAFRK PKSRNAATTL LMLGLIAVTM FGGVTALALI SDVHVAEHTG DLIGASGEQR
TVIAQVAAAV FGDGSPGFGY IAVVTALILM LAANTAFNGF PVLGSILARD GYLPRQLYTR
GDRLAYSNGI VLLAGFAILL IVVFDAQVTA LIQLYILGVF ISFTLSQTGM VRHWARILRS
DDPAASDPAA RRRIRRSQAI NFFGACLTGT VLILVLVTKF THGAWIVCLA IPIIFLGMRG
IRAHYDRVAV ELTPEPGPPT LPSRIHAVVL VSKIHAPTLR ALAYAKASRP HSLVAVTVAV
DQAEADRLRK AWTERGITVD LVVLASPYRE VTRPVLDYVA RIRRESPRDV VAVYVPEYVV
GHWWEHLLHN QSALRLKARL LFQPSVMVTS VPWQLASSRL AEQRFERTGA GAVRQGRATL
PGPLERKR