Gene Franean1_5130 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5130 
Symbol 
ID5673464 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp6145802 
End bp6146902 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content77% 
IMG OID641243980 
Producthypothetical protein 
Protein accessionYP_001509394 
Protein GI158316886 
COG category[R] General function prediction only 
COG ID[COG1721] Uncharacterized conserved protein (some members contain a von Willebrand factor type A (vWA) domain) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.355874 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0671024 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACGGCTC CCCGCGCGGA TCCGGCCGCG CCCCAGCCCG CGGCGACGTC GCCGTCGGTC 
GAGCGGACGC TGCGCAGCCT CGAACTCACC GTGACCCGCC GGCTGGACGG CATGCTGCTC
GGCGATCATC TCGGCCTGCT GCCCGGCCAG GGCACCGAGA AGGCCGAGAG CCGGGAGTAC
AACGTCGGCG ACGACGTCCG CCGGATGGAC TGGGCGGTCA CCGCGCGGAC GACCGTCCCG
CACGTGCACG ACCTGATCGC CGACCGGGAG CTGGAGACGT GGGCGCTGGT CGACCTGACG
GCCAGCCAGG AGTTCGGCAC CGCCTCGGTC CGCAAGCGCG ATCTGGCGAT CGCCGCGGTG
GCGGCGATCG GCTTCCTCAC CGCCCGCACG GGCAACCGGA TGGGAGCCGT GGCCCTCACC
CCGGCCGGGC CACGGGTCAT CCCCGCCCGG CCCGGCCGCC AGGGCCTGCG AACGCTGCTG
CGGACCCTGC TGACGGTCCC CGAGGGGGCG CACGACCGGC CGCTGCGCCG GCCCGACCCG
GCGGCCGCCA CCGATCTCGC CGCCGCAATC GCCGCCCTGG ACCGCCCGCG CCGGCGCCGT
GGCCTCGCGG TGGTCGTCAG CGACTTCCTC TCCACCGACC TCGGCTGGGA ACGGCCGATG
CGCGTCCTCG CGGCGCGCCA CCAGCTCCTC GCGGTCGAGG TCCTCGACCC GGCCGAGCTG
ACGCTGCCCG CCGTGGGCCT GCTTCCGGTC GTGGACGCGG AGACCGGCGA GCTGGTGGAG
GTTCCGACGT CCTCACGGCG ACTGCGTGAG CGCTACCGCC TGGCCGCGGC CGAGCACCGC
TCCCAGGTCG CCCTCGCGCT GCGCCGGGCG GGCGCCGGGC ACCTGGTGCT GCGCACCGAC
TCCGACTGGC TGATCGACAT CGTCCGCTTC GTCTCGGCGA GCCGGACGAG CCGCGGCGCG
GCACGACGCC CACCCGTGGA CTCGACCCGG CTACCGGGCC ACCCGCGATC GCTCCCGCCG
GCCACCGGCC GGGGTCGACC CGGGACAGCA GCTGTGGTCG GGGCAGGCGG CCGTCGAGGC
AGGCGGGCGG CGGCGCCGTG A
 
Protein sequence
MTAPRADPAA PQPAATSPSV ERTLRSLELT VTRRLDGMLL GDHLGLLPGQ GTEKAESREY 
NVGDDVRRMD WAVTARTTVP HVHDLIADRE LETWALVDLT ASQEFGTASV RKRDLAIAAV
AAIGFLTART GNRMGAVALT PAGPRVIPAR PGRQGLRTLL RTLLTVPEGA HDRPLRRPDP
AAATDLAAAI AALDRPRRRR GLAVVVSDFL STDLGWERPM RVLAARHQLL AVEVLDPAEL
TLPAVGLLPV VDAETGELVE VPTSSRRLRE RYRLAAAEHR SQVALALRRA GAGHLVLRTD
SDWLIDIVRF VSASRTSRGA ARRPPVDSTR LPGHPRSLPP ATGRGRPGTA AVVGAGGRRG
RRAAAP