Gene Franean1_6552 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_6552 
Symbol 
ID5674867 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp7968528 
End bp7969730 
Gene Length1203 bp 
Protein Length400 aa 
Translation table11 
GC content71% 
IMG OID641245401 
Producthypothetical protein 
Protein accessionYP_001510795 
Protein GI158318287 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.413404 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGTCA CGGCGGAAGA CCCGGTGGCG GCGTCCCAGG ACGTGGTGCT CACCCTGTTC 
AGAGAAACGC TCCAGGATAT GGCGATCAGG GCGTATATGC GTCCACCCGA CCGCCTGCTC
ATCACCCTGC TGCGCTCGTC ACGGGTGAGC CGGGTCCTGG TGGCGGAGCC GTTCCGCAGC
CTGCTCGGCA CGATGGTTCG GGGTGGGCGG ATGGTCGCCC TGCCGCCGGC CAGCGGACCC
GAGCGGTACC TGGTGTCGCC GCAGCGGTGG CGACGGGACG ATCCGGCGTC GCTGCCGCTG
GTGCGCAGCA CCTACCGTCG CTACGACGCC GCGCTGCGCC GGCGCGTGGC CCAGGCCAGC
TGTGAGCGGC CCGTCCTGAT TACCACGAAC CCCCTGGTGG CGGGCTTTGC CGAAGCGGAA
TGGGCGAACT CGGTCGTGTA TTTCGCGCGG GACGACTGGG CGTCGTCCCC GCCGCTGCGG
CGGTGGCATC CGGCATTCCG CCGGGCCTAT GTGGAGATCC GCCGACGCCG ACGGCCGGTC
ATCGCGGTCT CCCGGCCGCT CCTGGAGCGC ATAGACCCGA CCGGCGAGGG CCTGGTCGTC
CACAACGCCG TCGATCCGGC TGAATGGCGG CGTCCACCGG CCCCGCCGGA GTGGCTTCAG
CGGCTCCCGC GGCCGTGGTG TGTATATGCC GGCAGCGTCG ACGACCGCCT CGATCTGGAC
CTGGTTCGAC GCCTGGCCTC GGCCGGCACT GTGGTTCTGG CCGGCCCGGT CGAGCGCGAG
GAACACGTCA GACCGCTGCG GTCGGTTCCC TCGGTGCACC TGCCGGGCCA TCTGCCGCGG
CCGGTTGTCA CGGGAGTGAT CGCTGCGGCC GACGTGTGTC TGCTCACGCA CCGACGCACC
CCGCTCACCG AGGCAATGGA CCCTATCAAG ATCTACGAAT ACCTCGCGGC CGGCTGTCCT
GTGATCGCCA CGGACCTCAC CCCCGTCCGT GACATCAGCC CGCGGGTCCG GCGGCTGGGG
CCCGGGGAGG ATCCGGTGTC CGTGCTGCGC GAGGTTCTTG CCTGGCCCGC AGTTGACGAG
GCCGAGCGGC TGGCCTTTGT CGACCGCAAC AGCTGGGCAT CCCGGCATGT CAGCCTGCTC
CACTTTGCCC TCGGCGGGTC CTCGTCTCCC GTACCGGCCG GCCCCCTGCC GCTACACGCC
TGA
 
Protein sequence
MTVTAEDPVA ASQDVVLTLF RETLQDMAIR AYMRPPDRLL ITLLRSSRVS RVLVAEPFRS 
LLGTMVRGGR MVALPPASGP ERYLVSPQRW RRDDPASLPL VRSTYRRYDA ALRRRVAQAS
CERPVLITTN PLVAGFAEAE WANSVVYFAR DDWASSPPLR RWHPAFRRAY VEIRRRRRPV
IAVSRPLLER IDPTGEGLVV HNAVDPAEWR RPPAPPEWLQ RLPRPWCVYA GSVDDRLDLD
LVRRLASAGT VVLAGPVERE EHVRPLRSVP SVHLPGHLPR PVVTGVIAAA DVCLLTHRRT
PLTEAMDPIK IYEYLAAGCP VIATDLTPVR DISPRVRRLG PGEDPVSVLR EVLAWPAVDE
AERLAFVDRN SWASRHVSLL HFALGGSSSP VPAGPLPLHA