Gene Francci3_1152 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_1152 
Symbol 
ID3903580 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp1371157 
End bp1372689 
Gene Length1533 bp 
Protein Length510 aa 
Translation table11 
GC content72% 
IMG OID637878484 
ProductNmrA-like 
Protein accessionYP_480260 
Protein GI86739860 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0702] Predicted nucleoside-diphosphate-sugar epimerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGCATCC TCGTCACCGG GGCGACCGGG TACATCGGCG GCCGGCTGGC TCCCCGCCTG 
CTCGACCGTG GCCACCACGT CCGGGTCATG ACCCGGGATC CCGTCCGGCT GCGGGACATC
CCGTGGGCCG TCCGGGCGGA GGTCGTCCGC GCCGACGCCC GGGATCCGGA GTCGTTGCGG
TCCGCACTCG ACGGGATCGA GGTCGCCTAC TACCTCATCC ACTCGATCGA CAGTGGTGGT
GACTTCTCCG CGGTCGACCG GCGGGCGGCG AACGCCTTCG CCGCCGCGGC CCGGGCGGCC
GATGTCCGCA GGATCGTCTA CCTCGGCGGG CTGGGCCCGG CGACGACGGG TGGCACGTCC
GCCCATCTGA GCTCCCGCCA GGAGGTCGGG AGGATCCTGC TCGGCTCGGG GGTTCCGACG
ATCGTGCTGC GTGCGGCGGT CGTCATCGGC AGCGGCAGCG CCTCGTTCGA GATGCTGCGC
TACCTGACGG AGCGCCTGCC GGTGATGCTG ACGCCGCGCT GGGTGCGGAC CAGGATCCAG
CCAATCGCCG TCCGGGACGT GCTGCACTAT CTGATCGGCG CTCTGGACGT CCCCGACGAC
GTCGAGGGCA CCTTCGACGT GGGCGGCCCG GACATCCTGA CCTATGCGGA GATGATGCAG
CGCTTCGCCG CCATCGAGGG GCTGCGGCGC CGGATCATCG TCCCGGTTCC GATCCTGTCC
CCCGAACTGT CCTCGCTGTG GGTCGGGCTC GTCACCCCGG TGCCGGGCGG CATCGCGAGG
CCGCTCATCC GTTCCTTGCG CAACGAGGTG GTGGTCCACG ACCACCAGGT CGCCCGCTGG
ATCCCCGATC CTGCCGAGGG ACTGCTGCCC TTCGACGCCG CGGTGGCCCT CGCGCTGGCC
CGGGTGCGGG CGCGTTCGGT GAAGACCCGA TGGTCGACGG CGGTGTGGCC GGGTTCGGGA
GGCGCATCGG ACGACGGGGA CGGCGCCGGT GCCACGCACC CCCCGAACGA GCCCCTGCCG
ACCGACCCGC AGTGGGCCGG GGGATCCCTC TACGTCGACG AGCGCAGCAT GGCCGTGGCG
GCGCCGCCGG CCTGTCTGTG GCACGTCATC GAGGGCATCG GGGGCGACAA CGGCTGGTAC
TCGTGGCCCC TGGCCTGGTC CGCCCGCGGC TGGCTGGACA CCGCCCTCGG TGGCGTCGGT
CTGCGCCGGG GTCGGCGTGA TCCGCAGCGG GTGCACGTCG GGGAGGCGCT GGACTTCTGG
CGGGTCGAGG AGATCGAACC GGGCCACCTG CTCCGGCTGC GTGCCGAGAT GAAGCTGCCT
GGCGAGGCAT GGCTGGAACT ACGCTCGATG GTGGACTCCG AGGGCACCAC GACCTACTCG
CAGCGCGCGA GTTTCCTGCC CCGCGGACTT CCCGGGCAGC TCTACTGGTG GTCGGTCAGC
CCGTTCCACG CGGCCGTCTT CGGTGGAATG CTGCGCAACA TCGTGCGCAA GGCCGAGGAC
GAATGGGCCG CCCGCGCCGT CGCCACCGCC TGA
 
Protein sequence
MRILVTGATG YIGGRLAPRL LDRGHHVRVM TRDPVRLRDI PWAVRAEVVR ADARDPESLR 
SALDGIEVAY YLIHSIDSGG DFSAVDRRAA NAFAAAARAA DVRRIVYLGG LGPATTGGTS
AHLSSRQEVG RILLGSGVPT IVLRAAVVIG SGSASFEMLR YLTERLPVML TPRWVRTRIQ
PIAVRDVLHY LIGALDVPDD VEGTFDVGGP DILTYAEMMQ RFAAIEGLRR RIIVPVPILS
PELSSLWVGL VTPVPGGIAR PLIRSLRNEV VVHDHQVARW IPDPAEGLLP FDAAVALALA
RVRARSVKTR WSTAVWPGSG GASDDGDGAG ATHPPNEPLP TDPQWAGGSL YVDERSMAVA
APPACLWHVI EGIGGDNGWY SWPLAWSARG WLDTALGGVG LRRGRRDPQR VHVGEALDFW
RVEEIEPGHL LRLRAEMKLP GEAWLELRSM VDSEGTTTYS QRASFLPRGL PGQLYWWSVS
PFHAAVFGGM LRNIVRKAED EWAARAVATA