Gene Franean1_2503 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_2503 
Symbol 
ID5670899 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp2979523 
End bp2981442 
Gene Length1920 bp 
Protein Length639 aa 
Translation table11 
GC content79% 
IMG OID641241420 
Producthypothetical protein 
Protein accessionYP_001506841 
Protein GI158314333 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.116558 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGGTCG GCTCCGGGCG TTCCGCGACG GCGCGGCTCG CCGCCTCGAC GGCGGCCACG 
ACGCTGCTCG TGGCCGCGGC GACCGTTGCC GCCGGGTGGC CGTACTGGGA GCGCCACCGG
CTGGCCACCG TCCCGAACGT GGGCGTGGCG GTGGTGCTGG TCGTCGCCGG CATGCTGCTG
CGCGGCGCCC CGTCGTCACG CCGGTGTGGG GCGCTGCTGG CCCTCGGCGG CCTGTGCTGC
CCGCTGACCT GGCTGATGAG CTGGGACGTC GGCCCGCTGC CGCTGATCGC CGTCCTAGGC
CAGTCCGCCT GCTGGCTGGC CTACGCCTGG GGGCTGCTGC TCTACCCGGG GAACACCCTG
CGCGGCCGGG CCGACCGGTG GTGGGTCACG TCCGCGGCGG TGATCCTGCT GGGCGGGCAG
GTGCTGACGG TCGTCATCTC CCGGCCGTCC TGGAACGGCT TCACCGACGG TGTGCTGTGG
CCCGCGCCGT TGCTCGTCGG CCGGGCCCGT TTCGAGGCGG CGTTGGACGT CCTGCTCGTC
ATCTACGTCC TGGTGCCCGT GACGTTCGTG CTGCTGGCGG TCGCCCGGAT CCGGGCGGCG
CGGGGCCTGG AGCGGGTCGT CGCCGGTCCG GTGCTGCTCT CGGCGGGCTT CGTGGCGCTC
GTCGCCGCGG TGACCTACCC GGGCCTGATG GCCGAACCCG AGCTGGGCCG CATCGAGGAC
GCGGTCGCTC TGCAGGGGGC CGCGTCGATC GTCGCCCCCG TCGTGCTGCT CGCCGTCGGC
GCCCGCCGCC GGCTGCTGCT GGCGACGACC GCCGACCGGT TCGGCCGGGA GATCGGGTCG
CCGACACCGG CGTCGGTGCG GGCGGCGCTG CGGGCCCTGC TGCGCGACGC CACCCTCGAG
GTCTACTACC GCCAGCCGGG CACATGCCGC CAGACGGACA CGGAGGTGTT GGTGGATCTG
CACGGCCAGG TGGTGGGCCT CCCCCCGGAG GACCACCGGG GGGAACGGTC GGACCGGCCG
GGCCCGGAAC AGCGTTGGTA CGTCCCGGTG CGGTCCGCGG CCGGCGCGCC GGTCGCGGTG
CTGAGCATCG ATCCCGCCCT GCGCCGCCAC CGGCGGCAGG TGACGGCGGC GCTGGCCGCG
GCGGCCGCGG CGCTGGAGCA GGCGCGGGCG CAGAGCGGCC TGCGCGCGCA GCTCGTCCGC
CTGGGTGAGG AACGCCGGCG GGCCGCGCGC ACGCAGGCCC ACGAGTGGGC GCTGGTCGGG
CGGGAGCTGG ACGACGGCGT CCGCCGGCGC CTGGCCGAGC TCGCCGCGGC GGCGGGGGAC
GTCGCCCGGA CGGTGTCGGA ACCGGCCACC GCGCGGGCCC TGGCGGAGAT CGGCGAGGGG
CTGCGCGCGG CGCACGGCGA GCTCGCCGGC ATCGCCCGGG AGGCCCATCC CGCCGTCCTG
GAGCGGGACG GCCTGCTCCC GGCGTTGGAG AGTCTCGCCG CGGGGCTGGG GCTGGGCGGC
CCGAGCCTGC TGCGGGTGCC CGCCGGCCGC TTCGACGCGA CGGCCGAGCG GGCGATGTAC
GCGGCGCTCG CCGCCGCGCT GCGCGCCATC GCGGCCGCCG CCGCGGCTCC ACCGCCGGAC
ACGGCCGCGG GGATGCCGGA GCCGGCCACG GCGGGGCCGG TGGTGTCAGA GCTGGCGGTG
GGGCCGGTGG CGGAACCGGC CGCGGTGGGG CCGACGGCGG GCAGGAGAGC GTGGGGGGCC
GCCCGGGTCG AGGTGCGCGC CGAGGGCGCG ATGCTCGTCG GCGAGGTCAC CTGCGCGGTG
CCGGTGGCCG GCGGGGTGCG CGCCGCCGCC GACCACGCCC GGGCGCTGGG CGGTTGGGTC
GCCGTACGCG GTGTCGCCGG CGGCGCCACC ACGACCCGGG TGACGGTCCC GTGCGGGTAG
 
Protein sequence
MPVGSGRSAT ARLAASTAAT TLLVAAATVA AGWPYWERHR LATVPNVGVA VVLVVAGMLL 
RGAPSSRRCG ALLALGGLCC PLTWLMSWDV GPLPLIAVLG QSACWLAYAW GLLLYPGNTL
RGRADRWWVT SAAVILLGGQ VLTVVISRPS WNGFTDGVLW PAPLLVGRAR FEAALDVLLV
IYVLVPVTFV LLAVARIRAA RGLERVVAGP VLLSAGFVAL VAAVTYPGLM AEPELGRIED
AVALQGAASI VAPVVLLAVG ARRRLLLATT ADRFGREIGS PTPASVRAAL RALLRDATLE
VYYRQPGTCR QTDTEVLVDL HGQVVGLPPE DHRGERSDRP GPEQRWYVPV RSAAGAPVAV
LSIDPALRRH RRQVTAALAA AAAALEQARA QSGLRAQLVR LGEERRRAAR TQAHEWALVG
RELDDGVRRR LAELAAAAGD VARTVSEPAT ARALAEIGEG LRAAHGELAG IAREAHPAVL
ERDGLLPALE SLAAGLGLGG PSLLRVPAGR FDATAERAMY AALAAALRAI AAAAAAPPPD
TAAGMPEPAT AGPVVSELAV GPVAEPAAVG PTAGRRAWGA ARVEVRAEGA MLVGEVTCAV
PVAGGVRAAA DHARALGGWV AVRGVAGGAT TTRVTVPCG