Gene Franean1_0754 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0754 
Symbol 
ID5669170 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp879921 
End bp881171 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content74% 
IMG OID641239681 
Producthypothetical protein 
Protein accessionYP_001505118 
Protein GI158312610 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.200233 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000521831 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCCCGAGC CCGACCCCCT CTCCGCCCCC GCGCGCGACG TCGTGTTCAC CCTGTCCAGG 
GAGACGCTGC GGGACATGGC GATCCGCTCC TACATGCGGC CGCCGGACCG CGTCCTGCTC
ACCCTGATGC AGTCGCCGCG GGTGCGACGG CTGCTCGTCG CCGAGCCGTT CCGCAGCAGG
GTGACAGCGT TGGCGAAGGG CGACCCGGTG GTGGCCATTC CACCGACGAG CAGGCCGGAC
CGCTACGTCG TGTCCGCGCG CCGCTGGCGT CGCGAGGACC CGGTCTCGCC CGCGATGCTG
CGTCACACCT ACCGGCGCTA CGACCGGGTG CTGCGACGGG CCGCGGAGCA GGCCCGCTGC
GAGCGGCCCG TGGTCGTCAC GACGTACCCG CTGCTGGCCG GGCTCGCCGA GCTGGAATGG
GCCGACTCGG TCGTCTACTT CGCCCGTGAC GACTGGGCGA CCTACCCCCC GCTGGAACGC
TGGCACCCGG CGTTCCAGGA GGCCTACGCC GAGATCCGCC GCCGGCGGCG GCCGGTGGTC
GCGGTGTCGG AGTCGCTGCG CCGGCGGCTC GCTCCCACCG GAGGCTCACT GGTCGTCCAC
AACGGCGTCG ATCCCGCCGA GTGGGAGCGG CTGCCGCCGG CACCCGCGCT GATCGCGAGC
CTGCCCCGGC CGTGGTGCGT CTACGCGGGC ACCGTCGACG ACCGGCTCGA CGTCGACATG
GTCGCCCGGC TGGCGACGGA GAGCACGGTG ATACTCGCCG GGCCCGTCAA GGACGAGCGA
CACGCCGCCC CGCTGCGGGC GTTGCCCTCG GTCGTCCTGC CAGGCCATCT CCCCCGGCCG
GCCGTCACCG GGCTGATCGC CGCGGCGGAC GTGTGCCTGC TACCGCACCG GGTGACCTCG
CTGACGGAGG CGATGGATCC GATCAAGCTG TACGAGTACC TGGCGGCGGG GCGGCCCGTC
CTGGCGAGCG ACCTGACACC GGTCCGGGGC ATGGGACCGC GGGTCAGGCT GCTGGCGCCG
GGCGACGATC CGGTGGCCGC GTTCAGGGAG GTCCGCGGCT GGCCGGAGGT GACCGAAGCG
GAACGCCACC GGTTCGTCGC AGCGAACAGC TGGTCAGCCC GTCATATGGA GCTGCTGGAC
TTCGCTCTTG GCGGCGATCC GGAGCGGCAG CGGCGGCCCG CGGCCCGGCC GGCCCCGCCG
ATCGAAGGCC ACGCTAGGGC GACGTCAGCG AGGGCCGGCG AGGCGACGTG A
 
Protein sequence
MPEPDPLSAP ARDVVFTLSR ETLRDMAIRS YMRPPDRVLL TLMQSPRVRR LLVAEPFRSR 
VTALAKGDPV VAIPPTSRPD RYVVSARRWR REDPVSPAML RHTYRRYDRV LRRAAEQARC
ERPVVVTTYP LLAGLAELEW ADSVVYFARD DWATYPPLER WHPAFQEAYA EIRRRRRPVV
AVSESLRRRL APTGGSLVVH NGVDPAEWER LPPAPALIAS LPRPWCVYAG TVDDRLDVDM
VARLATESTV ILAGPVKDER HAAPLRALPS VVLPGHLPRP AVTGLIAAAD VCLLPHRVTS
LTEAMDPIKL YEYLAAGRPV LASDLTPVRG MGPRVRLLAP GDDPVAAFRE VRGWPEVTEA
ERHRFVAANS WSARHMELLD FALGGDPERQ RRPAARPAPP IEGHARATSA RAGEAT