Gene Franean1_1553 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1553 
Symbol 
ID5669956 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp1858385 
End bp1860865 
Gene Length2481 bp 
Protein Length826 aa 
Translation table11 
GC content77% 
IMG OID641240472 
Producthypothetical protein 
Protein accessionYP_001505898 
Protein GI158313390 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.323576 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.438631 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGTCA CCGTCCTGGG CATCCGCCAC CACGGGCCGG GCTCCGCACG GGCGGTCGAG 
GCGGCGCTCG CCGAGCTCGA CCCGGACCTC GTCCTCGTCG AGGGCCCGGC GGAGGGCGAC
GCCGCCCTCG CGCACACCGG CTCGCTCACC CCGCCGGTCG CGCTGACGGT CTACGCCCGC
GACGAACCGC GCGACGCCGC CTTCTGGCCG TTCGCCGAGT TCTCCCCCGA GTGGCGGGCG
CTGCTGCACG GCGCCGCCAC AGGCGTCCCG GTGCGTTTCG TCGACCTGCC CTTCGGGCTC
TCCCTGGCGC TGCGGCGCCA GGAGCAGGCC GAGGCAGCGG CCAGGGCCGT CGACGCCGAT
GCTGAGGCCG GCGGCCCGGA CGGCGCCGGT TCGCGCAGCG CCAACCCGCC CGGTGCCAGC
TCCCCCGGTG ATGCCGCTGG CGGTGACGGC GCTGGCGGTG ACGGCGCGGG CGGCGAGGGC
GGCGAGGGCG AGCCCGCGCA CCTGGTCGAG GACGACCCGC TCGGATGGCT GGCACGGGCA
GCCGGGCACG ACGACCCGGA GCGATTCTGG GAGGACCTCG TCGAACACCG CCGGCTCGCC
CGGGCCGGTT CGGGTGAGCC GGGCCGGGAC GGCGCCCTCG AGCTGTTCGC CGCGGTTGCC
AACGCGATGA CCGAGCTGCG CGACGAGGCC GGCACGGGCG ACCGGGTGTC CCCCACGCGG
GTGGAGCACC GGCGGCGGGA GGAGATGCGC GAGGCGCACA TGCGCCTGGA GATCCGCCGG
GCCGGCTCGG GGGCGGACGC GGCCGCTCGG ATCGCCGTCG TCTGCGGGGC CTGGCACGTG
CCGGCGCTGA CCGGGGGGGC GGGGCGGACG TCGGCCACCG CCGACCAGCG CACGCTGCGC
GGCGTGCGAG CCGTGCGCAC CGACACGACG TGGGTTCCGT GGACGCACGC GCGGCTCGCC
GCGGAGTCCG GGTACGGCGC CGGGGTCGCG TCACCCGGTT GGTACCACCA TCTGTGGACC
GCGCGTAACC AGGTCACGAC CCGCTGGGTG ACCCGGGTGG CGGGGCTGCT GCGCGCCGCC
GACCTGCCCG CGTCCTCCGC CTCGGTGATC GAGACCGTCC GTCTCGCCGA GGCGCTCGCC
GCCGTGCGCG AGCGGGCCGT CCCCGGGCTG ACCGAGCTGA ACGACGCCGT GCTCGCCGGT
CTGTGCGGGG GCGACCCGGT TCCGCTGGCG GTGGTGCGCG AGCAGCTCGT CGTCGGGCGG
GTGCTCGGCG CCGTCGGCGA GGACGTTCCG ACCGTGCCGC TCGCCGCCGA CCTCGCGCGC
CTGCAGCGCC GGCTGCGGCT GCGGCCCGGG GCCGACGACC AGAAGATCCG GCTGGACCTG
CGCAAGGACG TCGACCGGGA GCGCGGCCAG CTGCTGCGCC GGCTGCGCCT GCTCGACGTG
CCGTGGGGGA CGCCCGCAGC GACGTCGGGC ACCGGGACCT TCGCCGAGGC GTGGACGCTG
CGCTGGGAGC CGGAGTTCTC CGTCGCGGTG GTCGCCGCCG CCCGGTACGG CTCGACGGTC
GCCGACGCGG CCACCGCGGT CATCACCGAA CGGACGGAGG CCGCCGCCGA CCTGCCGGCC
GTCACCGCCC TGCTCGAGGC CGCGGTGCTC GCCGGGCTGC CCGCGGCGAT GGCGGTCGTC
GCGGCCGGGC TGGAACGCCG CGCCGCCGGC ACCGGCGACG TCGCCCACCT GATGGCGGCG
CTCGCCCCGC TGGCCCGCAT CCACCGGTAC GGCGACGTGC GCGCGACCGA CACGACGGGC
GTCGCCGCCC TCGCCGAGAG CCTCATGGTG CGGATCTGCG CGGGGCTCCC GCCCGCCTGT
GTGAGTCTCG ACGACGACGC GGCGGACGCG ATGGCCGCCG CGATCGACAG CGCCGACGGG
GCGTTCCGAC TGATCGCCGA CCCCGAGCAC GTCGAACGCT GGCACGTCGC CGTCCGCGCC
GCCGCCGACA TCCACGGTGG CAACAGCCTG GTCAACGGCA GGTGCACGCG GATCCTCTCG
GACGCGGGTG ACATCGACCA GGCAGAGGTG GCTCTCCGCC TTGACCGGGC GCTGTCCGCG
GCGGGCGTCG CGCCGGCCGA CGCCGCCCGC TGGCTGGAGG GCTTCCTCGG CTCCACCGGA
TCGGTGCTGG CCCGGGACCC GCGGATGCTC GGGCTCGTCG ACGCCTGGCT GGCCTCGCTC
ACAGCCGACG CGTTCACCGT CGTCCTCGCG CCGTTGCGGC GGGTGTTCGC GGCCTTCACG
GCACCGGAGC GCCGCATGAT CGCCGAACGG GCCGGGTCGG GGCTGCCGCG GCCGGCCGGT
GGCGGCACCG GCCCGACGAC ATCCACGGGC GGTCCGGCGG CCTGGGACGC CGACCGGGTC
GCGCTCGTCC TGCCTGTCGT CGCGTCGCTC CTCACGATCC CCGGCCTGGA AAGGACGGCC
ACACATGACC GAGATGCGTA G
 
Protein sequence
MSVTVLGIRH HGPGSARAVE AALAELDPDL VLVEGPAEGD AALAHTGSLT PPVALTVYAR 
DEPRDAAFWP FAEFSPEWRA LLHGAATGVP VRFVDLPFGL SLALRRQEQA EAAARAVDAD
AEAGGPDGAG SRSANPPGAS SPGDAAGGDG AGGDGAGGEG GEGEPAHLVE DDPLGWLARA
AGHDDPERFW EDLVEHRRLA RAGSGEPGRD GALELFAAVA NAMTELRDEA GTGDRVSPTR
VEHRRREEMR EAHMRLEIRR AGSGADAAAR IAVVCGAWHV PALTGGAGRT SATADQRTLR
GVRAVRTDTT WVPWTHARLA AESGYGAGVA SPGWYHHLWT ARNQVTTRWV TRVAGLLRAA
DLPASSASVI ETVRLAEALA AVRERAVPGL TELNDAVLAG LCGGDPVPLA VVREQLVVGR
VLGAVGEDVP TVPLAADLAR LQRRLRLRPG ADDQKIRLDL RKDVDRERGQ LLRRLRLLDV
PWGTPAATSG TGTFAEAWTL RWEPEFSVAV VAAARYGSTV ADAATAVITE RTEAAADLPA
VTALLEAAVL AGLPAAMAVV AAGLERRAAG TGDVAHLMAA LAPLARIHRY GDVRATDTTG
VAALAESLMV RICAGLPPAC VSLDDDAADA MAAAIDSADG AFRLIADPEH VERWHVAVRA
AADIHGGNSL VNGRCTRILS DAGDIDQAEV ALRLDRALSA AGVAPADAAR WLEGFLGSTG
SVLARDPRML GLVDAWLASL TADAFTVVLA PLRRVFAAFT APERRMIAER AGSGLPRPAG
GGTGPTTSTG GPAAWDADRV ALVLPVVASL LTIPGLERTA THDRDA