Gene Francci3_3990 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3990 
Symbol 
ID3906951 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp4774857 
End bp4776047 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content72% 
IMG OID637881319 
Producttransposase, IS4 
Protein accessionYP_483069 
Protein GI86742669 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCTGTGG GGGCATTGTC GCGTGACGCG GCCGGTGTGG TGGGAGCCGG CGCGGTGTCG 
CGTGAGGGAA TCTGGCAGCG GCTGCACGCG ATGGGCGAGC ACCGGGGCCG ACGTGGGCGG
GTCTACCCGC TGGCGGTGCT GGCGGCGGTG TGGCTGTGCG CGCTGACCGC GGCGGGCCAT
GACCGGGTCA CCGCCGTGAT CGAATGGCTC GCCGGCACCA CGGAGGAGGA ACGGCGCCGG
CTGCGGCTGC CGTACGACCC GTTTGACGGC TACCGGCTGC CGAGCGAGTC GACGATCCGC
CGGTTCCTCA ACGACACCGA CGACGCGCGG CTCGCCCGCG CACTGCTCGA TCCGCCGCTG
GCCGACCCGG CCCCGCCGAA GCCGGCGTCA CCCGAGGCGG CGGGAGAAGC GGTGCGGGCG
GTCTACGCAC TGGATGGGAA GACCAGCCGC GGCGCGAAGC GCGCCGACGG GCGGCAGGTC
CAACTGGTCG GGGTCGCCGA CCAGGCGACC GGCCGGCTGG TCAACCAGCA CGAGGTGGAC
TCGAAGAGCA ATGAGACGAA GGCGTTCCGC CCTGTCCTCG AGCCCCTCGA CCTTGCCTGC
GACCTGTTGA CCTTCGACGC TCTGCATACC GTGCGCGACC ATCTCGACTG GCTGGTCACC
GACAAGAAGG CCCATTACCT TGCGGTCGTG AAGGGCAACC AGCCAACGCT GCGCGCGTTC
CTGGCCGCGT TGCCGTGGGC GGACGTCCCG GTCGCCGACA CCACCCACGA CCATGGCCAC
GGCCGTGACG AGACCCGCAC GCTGAAGGCC GCGACCGTCG AGCACGCCGA GTTCCCGCAC
GCCCGCCAGG CCGTCCGGAT CCAGCGCTGG CGCCGGGAGA AGGGCAGGAA ACCCAGCCGG
GAGACGGTCT ACGGCATCAC GGATCTGGCC TTCGAGCAGG CCTCGGCCGG GTTCCTCGCC
GACGCCGCCC GCGGCCAGTG GATCATCGAG AACCGTCAGC ACCACGTCCG TGACGTCACT
TTCGGTGAGG ACGCGAGCCG GAGCCGGACC CGCCGCGGCC CGGTCAACCT CGCGATCTTC
CGCGCCACTG TCGCGCACGC GGTCCGCGCC GCGGGCCACC GCTACGTTCC GGCCGGGCGC
CGGGCCTGCA AGACCGCCAC CGCCGCGCTC GACCTGCACG GCTTTCCCTG A
 
Protein sequence
MPVGALSRDA AGVVGAGAVS REGIWQRLHA MGEHRGRRGR VYPLAVLAAV WLCALTAAGH 
DRVTAVIEWL AGTTEEERRR LRLPYDPFDG YRLPSESTIR RFLNDTDDAR LARALLDPPL
ADPAPPKPAS PEAAGEAVRA VYALDGKTSR GAKRADGRQV QLVGVADQAT GRLVNQHEVD
SKSNETKAFR PVLEPLDLAC DLLTFDALHT VRDHLDWLVT DKKAHYLAVV KGNQPTLRAF
LAALPWADVP VADTTHDHGH GRDETRTLKA ATVEHAEFPH ARQAVRIQRW RREKGRKPSR
ETVYGITDLA FEQASAGFLA DAARGQWIIE NRQHHVRDVT FGEDASRSRT RRGPVNLAIF
RATVAHAVRA AGHRYVPAGR RACKTATAAL DLHGFP