Gene Franean1_4899 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4899 
Symbol 
ID5673239 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5882087 
End bp5883853 
Gene Length1767 bp 
Protein Length588 aa 
Translation table11 
GC content74% 
IMG OID641243754 
Producthypothetical protein 
Protein accessionYP_001509170 
Protein GI158316662 
COG category[S] Function unknown 
COG ID[COG5305] Predicted membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0731796 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0477636 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCGACG TCGCGGTGCG GCCGGCGGAA CACACCCCCG AGCCGGTCGA CAGGGTCGGA 
GCTCGACCCG ACCGCGAGCA CCGGATCTTC CGGGGCGCGC TGGTGCTGAT CATCGCTGCG
GCGGTGGTGG TGCGCTTCAT CGCACGCCAG CCGCTGTGGC TCGACGAGGC ACAGAGCGTG
GCGATCGCCC GCCTGCCGCT GTCGGGTGCG GCGCCCACCA TGTGGGACGG CCTCCTCCAG
GACGGCTCGC CGCCCCTCTA CTACATCCTG CTGCACCTGT GGATCAGCGT CTTCGGCGAC
GGCACCGCGT CGGTGCGCGG GATGTCCGCC GTCATCAACC TGGGCTCCGC GGTGCCGGTG
TTCTATCTCG GACGCCGGCT CGTCGGTGAC CGCGGCGCGA AGGTCGCGGT GGTGCTGTAC
CTCACCTCGC CGTTCGCGCT GTACTTCGGC ACCGAGACCC GGATGTACAG CCTGATCGTG
CTGCTCACGG CGCTCGGCGG GCTGGCCCTG GAGCGGGTGC TGCGGGTGCC GTCGGTGAAG
AACGTGGTGC TGCTGGCCGT GGCGTCGGGA TGCCTGGCCC TGACGCACTA CTGGTGCCTC
TACCTGCTGA TGACCGTGGG TGCGTGGCTG GTCGGGCTCG TCTTCGTCCG GCCGCGGGTG
GCGGCGGCGC GCGCCGCCCG CCGCGGTCGC GCCGGGCGTT CCGGGGCGCA CTCGCGCGGC
GGGCGCCGTC CGGGCGGGCC GGCGCCGACG CCGGCTGGCG CATCCGAGCA GCCCACGGCC
GCGCCCGAGC TGCTCGCCGA CGCCGCCGGC GGCGCACGCC GGGCCGTGCC CACCTGGCAC
CCCCGCGGCC CGCTCGCCGG GATCATCGGG ATCATCGCCG GCGGCCTGGT GTTCGCTCCG
TGGCTGCCGA ACTTCCGCTC CCAGCTCGCC CACACCGGGA CGCCCTGGGG CGAGCCGGCG
AGCTTCGCCG CCGTCAGCCA CGCGTACGGG CAGTGGGCCG GTGGGCCGAC CACACTCGGC
CGGCTGTTGC TGTTCCTGAT CACCGGCCTG GTCGCCGCCG GGATCGCCGG CCGTCCACTG
GGCGGGCGGT TCGTCCTGCT CGACCTGAAG GGCCTCGAGC CGGGCCGCAC GCTGTTCTTC
CTGGCCACCG GGACGCTGAT CGTCGCCGTG GCGGCCGGCA AGCTGGTGGG CAACGCGTGG
GCCGACCGGT ACACCGCGAC GGCGTTCGTC CCGTTCCTGC TCGTGGTGGG ACTGGGCGCG
ACAATGCTGG CCGACCGCCG GGTGTTCCAC GGCGTGGTCG CGGTCGCGGC GCTCGTCGGT
GTCATCGCCG GGACGAGCGA CGTGCGGCGG GAACGGTCCC AGGCCGGCGA GGCCGCGGCC
GTCCTCACCC GGCTGTCCAG GCCGGGCGAC ATCCTGCTGG TGTGCCCGGA CCAGCTGGGG
CCGGGCCTCG CGCGCACCGT GCCGTCCTGG CTCAAGGTGT ACGTGGTGCC GACCTACGCC
GCCCCGGACC GGGTCGACTG GGTCGACTAC GAGGAGCGCA ACGAGTCCGC CAACGGCGTG
GCGATCGCCC AGCGGGCGAT CGCGGAGGCC GGGACGAACA CCGTGTTCAT GGCCGGCTCC
GGCGCCTACC GGACCTACGA GGTGCTGTGC ACCCAGGTCC GGGCGACGCT GCAGACGCAG
CGGCCGGTGG CCGACGAGGT GATGGAGCAG GGTCTGCCGG CACGGGTCTA CGAGAACTAC
GCCCTGCTGC GGTTCCGGGC GTCGTGA
 
Protein sequence
MTDVAVRPAE HTPEPVDRVG ARPDREHRIF RGALVLIIAA AVVVRFIARQ PLWLDEAQSV 
AIARLPLSGA APTMWDGLLQ DGSPPLYYIL LHLWISVFGD GTASVRGMSA VINLGSAVPV
FYLGRRLVGD RGAKVAVVLY LTSPFALYFG TETRMYSLIV LLTALGGLAL ERVLRVPSVK
NVVLLAVASG CLALTHYWCL YLLMTVGAWL VGLVFVRPRV AAARAARRGR AGRSGAHSRG
GRRPGGPAPT PAGASEQPTA APELLADAAG GARRAVPTWH PRGPLAGIIG IIAGGLVFAP
WLPNFRSQLA HTGTPWGEPA SFAAVSHAYG QWAGGPTTLG RLLLFLITGL VAAGIAGRPL
GGRFVLLDLK GLEPGRTLFF LATGTLIVAV AAGKLVGNAW ADRYTATAFV PFLLVVGLGA
TMLADRRVFH GVVAVAALVG VIAGTSDVRR ERSQAGEAAA VLTRLSRPGD ILLVCPDQLG
PGLARTVPSW LKVYVVPTYA APDRVDWVDY EERNESANGV AIAQRAIAEA GTNTVFMAGS
GAYRTYEVLC TQVRATLQTQ RPVADEVMEQ GLPARVYENY ALLRFRAS