Gene Franean1_4163 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4163 
Symbol 
ID5672518 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4947125 
End bp4948198 
Gene Length1074 bp 
Protein Length357 aa 
Translation table11 
GC content75% 
IMG OID641243036 
Productcobalamin (vitamin B12) biosynthesis CbiX protein 
Protein accessionYP_001508453 
Protein GI158315945 
COG category[S] Function unknown 
COG ID[COG2138] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.108946 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.438631 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAACGCCA CCCCGCCGCA CCCCGGCACC GTCCCCACCG CCACCGTTTC GGATCCGGCG 
GGTTCCGCCG CCGGGCCGGC GCTGCTGATC GTCGGGCATG GCACCCGGGA TGAGGCCGGC
GCCGAGCAGT TCCGCCGGTT CGTCTCCCGG GTGCGCCAGC GGGCCGGTGG GCTGGCGGTG
GACGGCGGGT TCATCGAGCT GTCCGCCCCG CCGGTCGCCG ACGCGGTCAG CCGGCTCGTC
GACGCCGGCC ACCGGCGGCT CGGGGTCGTT CCGCTCACCC TCGTCGCCGC CGGGCATGCC
AAAGGTGACA TTCCCGGGTC GATGGCCCGG GAACGCGAAC GCCACCCGGG GCTGCGCTAC
GCCTACGGGC GTCCCCTCGG CCCGCATCCG ACGATCCTGC GGCTGCTCGC TGAACGTGTC
GACGTCCTCT GCCCGCCGGG CCGCCGGGAG CAGACCACGG TGCTGCTGGT CGGGCGGGGT
TCGACCGACC CGGATGCCAA CGCCGAGGTG TTCAAGGTGG CCCGGCTGCT GTGGGAGGGC
CGCGGCTACG GCGGGGTGGA GGTGGCGTTC ATCAGCCTGG CCGAGCCGTC GGTGCCGGCC
GGGCTGGAGC GGATCCACCG GCTCGGCGGC CGGCGGATCG TCGTCGTCCC CTACTTCCTG
TTCACCGGGG TGCTGCCGCG GCGCACCGTC GAGCAGGCGG CCGGCTGGGC GGGGGGGCAC
CCGGAGGTGG AACTGGCCTG CACGGGCCTG CTCGGCGACA GCGCCGGCGC CGGCGGGGAG
GGTGACGGGC TGGTCGAGCT GGTGTTGGAG CGCTACCGGG AGGCGCTCGG CGGGGACATC
CGGATGAACT GCGACACCTG CCTGTACCGG ATCGCGCTGC CGGGGTATGC GCACCGGGTC
GGGCAGGCGC AGACCCCGCA CGACCATCCC GACGACCCGT CGCACTCCCA CGGGCCGCAC
GGCCACCACC ACCATCCGCA TTCCCCTGAC CCGCACGGCC ACGGTGATCC GCACGCTGCG
GGTGGGTCGG TGGGACGGGC GGGGCTCGCG GTGACGCTGC GCGCCGACGA CTGA
 
Protein sequence
MNATPPHPGT VPTATVSDPA GSAAGPALLI VGHGTRDEAG AEQFRRFVSR VRQRAGGLAV 
DGGFIELSAP PVADAVSRLV DAGHRRLGVV PLTLVAAGHA KGDIPGSMAR ERERHPGLRY
AYGRPLGPHP TILRLLAERV DVLCPPGRRE QTTVLLVGRG STDPDANAEV FKVARLLWEG
RGYGGVEVAF ISLAEPSVPA GLERIHRLGG RRIVVVPYFL FTGVLPRRTV EQAAGWAGGH
PEVELACTGL LGDSAGAGGE GDGLVELVLE RYREALGGDI RMNCDTCLYR IALPGYAHRV
GQAQTPHDHP DDPSHSHGPH GHHHHPHSPD PHGHGDPHAA GGSVGRAGLA VTLRADD