Gene Franean1_5254 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5254 
Symbol 
ID5673588 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp6316086 
End bp6317513 
Gene Length1428 bp 
Protein Length475 aa 
Translation table11 
GC content72% 
IMG OID641244109 
ProductVWA containing CoxE family protein 
Protein accessionYP_001509518 
Protein GI158317010 
COG category[R] General function prediction only 
COG ID[COG3552] Protein containing von Willebrand factor type A (vWA) domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.697295 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.473952 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCTGC TCGACCGCAC CGTCGAGTTC ACCGCCGCGC TGCGCCGCGC CAACGTCCCG 
GTGAGCAGCG CCGAGACGGT GGACGCCGCG CGGGCGGTCG GTGCCATCGG CTGGGCCGAC
CGGGACGCCC TGCGGGCCGC GTTCGCCGCG ACGATGTGCA AGCGCCCGCT GTACCGGAGC
GCGTTCGACT CGCTGTTCGA CCTGTACTTT CCGCCGCGGA TCGGCGACGG TGTCGTGCTG
CCCGACGAGG GCGGGCCCGC CGGGGAGCAG CAGCCGGGGG AGCAGCGGGA CGGCGACCCG
TCCGAGCTGA CCCCGCAGGA GATCGAGGCG CTGCGCCGGG CCATGCGCGA CCAGCTCCGC
GACGCGCTGC TCGACGGCGA CGACGAGAAG CTCCGCGACC TGGCCCGCCG TTCGGTCAGC
GCCTTCGGCG CGGCGCAGAA CGGGCCGGGG CAGCGCAGCT ACTTCATCTA CCGGGTGCTG
CGGGCGATGT CCCCGGAGAC ACTCATCGCG GACCTGCTCG CCGCGATGCT CGGCGACGAC
GATCGCGGCG GCCTCGAAGA GCGGATCGCG CGGCAGACGA TCGCCGACCG CATCCGGGCG
TTCGAGGAGA TGATCTCCTC CGAGGTCCGC CGGCGGATGG CCGAGGAACG CGGCATCGAG
GCCGTCGAGC GGACCGCGGT GAAGCCGCTC GCCGACCAGG TCGACTTCCT GCGTGCCTCC
CAGCGTGATC TGGTCGAGCT GCGCCGCCAG GTGTATCCGC TCGCCCGCCG GCTGGCGACC
AGGCTGACGG CGCGGCGCCG GCTCGGCCGG GCCGGCCGGC TCGACTTCCG CCGCACCGTC
CGGGCATCGC TGGCGACCGG TGGCGTGCCG ATCGAGACCA AGCACCGGCC GCACAAGCCG
CACAAGCCCG AGCTTGTCGT GCTGTGCGAC GTGTCCGGAT CGGTGTCCTC CTTCGCGCAC
TTCACCCTCA TGCTCACGCA CGCGCTGCGC GAGCAGTTCT CCAAGGTCCG CGCGTTCGCG
TTCATCGACA CGACCGACGA GGTCACCCGC TTCCTGCGCG GGCTCGAGCT GGGCGACATG
ATGGCCCGGA TCGCCTCCGA GGCCGACCTG GTCTGGTTCG ACGGCCACAG CGACTACGGC
CACGCGATCG AGGTCTTCGC CGAGAAGTAC CCGGACGCAG TCGGGCCGCG GACGTCGCTG
CTCGTCCTGG GGGATGCCCG CAACAACTAC CGGGCCACCT CGGCCGCGGT GTTCCGCCGG
CTGTGCGGGC AGGCCCGGCA CTCCTACTGG CTGAACCCGG AGCCGCGCAG CTACTGGGGC
TCCGGCGACT CGGCCACCAC CGCCTACGCG GACCTCGTCG ACGAGATGGT CGAGTGCCGC
AACGTCGAGC AGCTCCAGCA CTTCATCGAG CGTCTGCTAC CCACCTGA
 
Protein sequence
MNLLDRTVEF TAALRRANVP VSSAETVDAA RAVGAIGWAD RDALRAAFAA TMCKRPLYRS 
AFDSLFDLYF PPRIGDGVVL PDEGGPAGEQ QPGEQRDGDP SELTPQEIEA LRRAMRDQLR
DALLDGDDEK LRDLARRSVS AFGAAQNGPG QRSYFIYRVL RAMSPETLIA DLLAAMLGDD
DRGGLEERIA RQTIADRIRA FEEMISSEVR RRMAEERGIE AVERTAVKPL ADQVDFLRAS
QRDLVELRRQ VYPLARRLAT RLTARRRLGR AGRLDFRRTV RASLATGGVP IETKHRPHKP
HKPELVVLCD VSGSVSSFAH FTLMLTHALR EQFSKVRAFA FIDTTDEVTR FLRGLELGDM
MARIASEADL VWFDGHSDYG HAIEVFAEKY PDAVGPRTSL LVLGDARNNY RATSAAVFRR
LCGQARHSYW LNPEPRSYWG SGDSATTAYA DLVDEMVECR NVEQLQHFIE RLLPT