Gene Francci3_0439 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_0439 
Symbol 
ID3903628 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp521590 
End bp523197 
Gene Length1608 bp 
Protein Length535 aa 
Translation table11 
GC content71% 
IMG OID637877770 
Productselenocysteine synthase 
Protein accessionYP_479554 
Protein GI86739154 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1921] Selenocysteine synthase [seryl-tRNASer selenium transferase] 
TIGRFAM ID[TIGR00474] seryl-tRNA(sec) selenium transferase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCCGCT TTGATCATCG AAAGGACGGC CAGAGCCGTG CGGGCAGCGG GCCGATCCTA 
CAGGCACCCA GCAGGGGCAG ACACCCGACA GGCCACGCCG CCGTCCCGGC CAGGTCCGGA
ATCCCACGAG GAGGAGCACT AGCTTCACCG AGAGTGATTG ATCGTACAAG TCTGTTCGCA
TCCCCGGCCC AGGACAGCCC GACGACGCCG GTAGCGACAC CGATGACGCT CGCCCGCGCC
GCCACCGGAC TGCGCCCAGT AATCAACGCT ACCGGCGTCC TGCTCCACGG AGGGCTCGGC
CGAGCTCCCC TGTCCGCGGC CGCCCGCGCG GCCCTCGACC TGGCAGCCGG CACGACCGAC
CTCGAGCTTG ACCTCGGGAC CGGACGGCGA GGACGACGAG GACGTACCGC CCTGGCCGCG
TTGGCAGCCG CGGTGCCCGC GGCGGCGGGC GTGCACATCG TCAACAACAA CGCGGCAGGG
CTGATCCTCG CGGTGACGGC GCTGGCCGGG CGGCGGGAGA TCGTGGTGAG CCGCGGGGAG
CTGGTGGAGA CGGCGGACGG GTTCCGGCTG CCCGAACTGC TGGTCTGCAC CGGGGCTCGA
CTGCGGGAAG TCGGCACCAC CAACCGCACG ACGCTCGGGG ACTACGCCGA CGCGATCGGG
CCGGAGACCG GGCTCGTGCT GCACGTGCAT CCGTCCAGCT ATCAGGTGGT GGGGCTGACT
GACACCCCTG CGATCAACGA TCTTGCCGCG CTCTGCGCCG ATTTCGAGAT CCCCCTCGTC
GGTGACTGCG GCTCCGGGCT GCTACACCCC GAGCCGCTCC TCGCCGATGA ACCCGACGTC
ACGACCTGGC TCGGCTTCGG CGTTAACGTG GTCACGACCA GCGCGGACAA GCTGCTCGGC
GGGCCTCAGT GCGGCCTGCT GCTGGGACGC GCCGACCTGA TCGACCGGAT ACGTCGGCAT
CCGATGGCCC GGGCAATGCG GGTCGGAAAG CTCACACTCG CCGCGCTGGA AGCCACCCTC
AGCCATCCCG ACTCCCCGGT CAGGCAGGCG TTGCACGCCC GACCGGAACG GCTGACAACG
CGGGCCGAGA CGCTCGCCGC GTGGCTACGG CGTTCGGGCA TCGCAGCCTC GGCTGTGGCG
AGTCGTGCGG TCGTTGACGG TGGCGGGGGT GGATCGTCGC CGCTGCCGAA CGACGCCGCA
TTCCGTCGAT CGGGCGGCCA CCGGCGCGAG GGCGCACGAG ACGAAGACGG CATGGCCCCG
GCGGGCCCGG AGAGTGGTCT GACGATCAAA GGGTTCGGCG GCTTCGGCCC GGAGCTGCCA
AGCGCCGCGG TGGCCCTGGA CGCAGTCATC GCCGCCCCCC TGCGCCAGGG CGAGCCGTCC
GTGTTGGGCC GGGTGGAGCA GGGATGCTGC CTGCTCGATC TGCGGACCGT TCCGGTGGAA
CTCGATCCCG TCCTCGCCGC CGCCGTGCTG GCTGCCGCGA CCGAAGCCGG CGCGACCACG
ATCACCGACG CCCCGACCGA GCCGCACATC GCCGAGGATG CCGACGTCAT GACGATCGAA
TCCATGGCCA TCCCGCGCGC CACCGTGCCC GGCCGACGGG CGCCGTGA
 
Protein sequence
MRRFDHRKDG QSRAGSGPIL QAPSRGRHPT GHAAVPARSG IPRGGALASP RVIDRTSLFA 
SPAQDSPTTP VATPMTLARA ATGLRPVINA TGVLLHGGLG RAPLSAAARA ALDLAAGTTD
LELDLGTGRR GRRGRTALAA LAAAVPAAAG VHIVNNNAAG LILAVTALAG RREIVVSRGE
LVETADGFRL PELLVCTGAR LREVGTTNRT TLGDYADAIG PETGLVLHVH PSSYQVVGLT
DTPAINDLAA LCADFEIPLV GDCGSGLLHP EPLLADEPDV TTWLGFGVNV VTTSADKLLG
GPQCGLLLGR ADLIDRIRRH PMARAMRVGK LTLAALEATL SHPDSPVRQA LHARPERLTT
RAETLAAWLR RSGIAASAVA SRAVVDGGGG GSSPLPNDAA FRRSGGHRRE GARDEDGMAP
AGPESGLTIK GFGGFGPELP SAAVALDAVI AAPLRQGEPS VLGRVEQGCC LLDLRTVPVE
LDPVLAAAVL AAATEAGATT ITDAPTEPHI AEDADVMTIE SMAIPRATVP GRRAP