Gene Franean1_6445 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_6445 
Symbol 
ID5674760 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp7836951 
End bp7838834 
Gene Length1884 bp 
Protein Length627 aa 
Translation table11 
GC content76% 
IMG OID641245293 
Producthypothetical protein 
Protein accessionYP_001510688 
Protein GI158318180 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.131326 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0967044 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGGACAA CCGGCACCGC GGCGGATCCC GCTGACCCCG CCGAGCCGAG CGACCCCCTC 
CCCGCGGATC CCCTCCCCCC TGATCCCGTC CCCGCTGATC CCGTCCCCGC CGGTGCGCGG
CGGGACGGGC CGCGGCCGAG TCACCGGCAC CGACGTCAGC GGCGCCGCCG GCGCCCGCTG
GGCCGCGCGG GCCTGGTCCC GGCGGCGGTC CTCGCGGGGT ACCTCGCGCT CGCTGTCGCC
GTCTTCCGTG CCGCGTGGGC CGATCCCGGC GGCGTGGTTT ACGGCTACAG CGACTCGGTG
CTGTTCGCGT GGTACCTGGG CTGGGTCCCG CACGCCCTGT CCGCAGGCAT CGACCCGTTC
GTCACCTCGT ACCTGAACGC CCCGACCGGC ACGAACATCC TGTGGAGCAC GCCCGTCCCG
CTGCTCGGGC TGGTGACCGC GCCGGTGACC GCGCTGTTCG GCCCCGTCGT CTCGCTCACC
CTGCTGCTGA CACTGGCGCC GGCGCTCTCC GCGTTCGCCC TGTTCTGGGT GCTGCGGCGC
TGGGTGCCGG CGCCGCCGGC CGCCGTCGCC GGCCTGCTCT ACGGCTTCGG CCCGTACATG
GTCGGCGAGT CGTACGGGCA TCTGCATCTC ACCTTCGCGG TCTTCCCGCC GCTGCTGCTG
CTCCTGCTCG ACGACCTGAT CGTGCGCCGG CGCCCACCGG GGCGCACCGG CGTGCTGCTC
GGCCTGGCGG TCGCTGCCCA GGCCATGATC AGCGAGGAGG TGCTGGCCAC CGCCGCGCTG
CTCGGCGCGC TCGGGCTCGC CATCGCCGGG CTGGCCCACC GCGCCGCCGT CCGCGCGCGG
GCGGGTGCGC TGCTGCGGGG CCTGGCGGCC TGCGGCACGA CGGCCGGGAC GCTGCTCGCC
TGGCCGCTGA CCGCGCAGTT CCTCGGCGAC CAGCGGGTCC ACGGCAACAT CCAGCCGCAC
AACGTCGCGG TGTCCGATCT GCTGACCTTC GTCACGCCGA CCCCAGCCCA GCGGATCGCA
CCCGACGTGG CGCTGCGGCA CAGCCTGCGT TTCACCGGCA ACGCGGTCGA GGTGACCGGC
TACCTCGGGC TTCCCCTGCT GCTCGGGGTG GCCGCGATCG CCGTCCGGTT CCGCCGCGAG
CCGCTGGTGG CGGTGTTCGC CCCGCTCGGC GCGGTGACGG CGCTGCTCTC CCTGGGCGGC
CACCTGCACG TGGACGGACG GGTCACCGGC ATCCGCCTGC CCTGGCTGCC GCTGGAGAAC
CTCCCGGTGA TCAGCAGCGC GCTCCCGTCC CGGCTCGCGC TGTATCTGGC GATGTCCGTG
GCGATCGTCC TCGCGGTGGG CCTGACCCGC GTCGCCGCGT CCGCCCGGTT CCCGCGGCCG
GTCACCCGGG CCGGGCTCGT GCTGCTCACC GCGGTGATGC TGGCGCCGCT CGTGCCGCGC
AGCCACGTGG CGACACCGGC CGCCACGCCC GCCTTCTTCA CCGGCGACGC CGTCCGCGCG
GTTCCCGAGG GATCGACGGC GCTGGTGCTG CCCTACCCCT ATCCGGCCCG CACCGAGGCG
ATGCTCTGGC AGGCCGAGGC GGGCTACCGG TTCCGGCTCC CGGGCTGCTA CTGCACCGTC
CCGGGCCCGG ACGGGCGCGC CGTCTTCAAC GCGTGGACCG ACCCGCTCAA CGGCGCGCTG
GTCGCGGTCG AGCAGGGCCG GTCGGACGCG GCGGCCGCGC TGGCCGATCC CGCCGTGCAG
GCCGCCTTCG ACCGGCTCGC GCCCGCCGCG GTGATCCTCG GCCCGAGCGC GAACCGGGAC
GAGCTCGCCC GGCTGGTGAC CGGCCTGGCC GGCGCCGGGC CGGCGGACGT CGACGGGGTC
CAGCTGTGGC TGACAGCGCC CTGA
 
Protein sequence
MRTTGTAADP ADPAEPSDPL PADPLPPDPV PADPVPAGAR RDGPRPSHRH RRQRRRRRPL 
GRAGLVPAAV LAGYLALAVA VFRAAWADPG GVVYGYSDSV LFAWYLGWVP HALSAGIDPF
VTSYLNAPTG TNILWSTPVP LLGLVTAPVT ALFGPVVSLT LLLTLAPALS AFALFWVLRR
WVPAPPAAVA GLLYGFGPYM VGESYGHLHL TFAVFPPLLL LLLDDLIVRR RPPGRTGVLL
GLAVAAQAMI SEEVLATAAL LGALGLAIAG LAHRAAVRAR AGALLRGLAA CGTTAGTLLA
WPLTAQFLGD QRVHGNIQPH NVAVSDLLTF VTPTPAQRIA PDVALRHSLR FTGNAVEVTG
YLGLPLLLGV AAIAVRFRRE PLVAVFAPLG AVTALLSLGG HLHVDGRVTG IRLPWLPLEN
LPVISSALPS RLALYLAMSV AIVLAVGLTR VAASARFPRP VTRAGLVLLT AVMLAPLVPR
SHVATPAATP AFFTGDAVRA VPEGSTALVL PYPYPARTEA MLWQAEAGYR FRLPGCYCTV
PGPDGRAVFN AWTDPLNGAL VAVEQGRSDA AAALADPAVQ AAFDRLAPAA VILGPSANRD
ELARLVTGLA GAGPADVDGV QLWLTAP