Gene Francci3_3900 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3900 
Symbol 
ID3906668 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp4664721 
End bp4666742 
Gene Length2022 bp 
Protein Length673 aa 
Translation table11 
GC content73% 
IMG OID637881226 
Producthypothetical protein 
Protein accessionYP_482979 
Protein GI86742579 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1331] Highly conserved protein containing a thioredoxin domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCTAACA AGCTCGCGGA GCAGACGTCT CCGTACCTGC TGCAGCACGC GGACAACCCG 
GTCGACTGGT GGCCCTGGAG CCCGGCGGCC TTCGCCGAGG CGGCCCGGCG GGGGGTGCCG
GTGCTGCTCT CGGTGGGCTA CGCATCCTGC CACTGGTGCC ATGTCATGGC GCACGAATCG
TTCGAGGACG CCGCGACCGC CGAGTACATG AACGACCACT TCGTCAACAT CAAGGTGGAT
CGGGAGGAGC GGCCCGACGT CGACTCCGTC TACATGGACG TCACCGTCGC CCTGACCGGG
CACGGCGGCT GGCCGATGAC GGTGTTCCTC ACCCCTACCG CCGAGCCGTT CTTCGCCGGC
ACCTACTTCC CGCCGCGTCC CCGGCCCGGC ATGGGGTCGT TCCGCCAGGT CCTGACGGCG
GTCACCGAGG CCTGGCGGAC CCGGCGGGAC GAGATCGAGG AGTCGGGGGC GGACATCGCC
CGCCGGCTCG CCGAGGCGGC GACGCGTGGT CCGGCGTCCG GTCTCGCAGC CGAGATCACC
CCGGCTCTGC TCGACACCGC CGTGGCGGGG CTGTCGGCCC GCTTTGACGC CCGTCACGGT
GGATTCGGCG GTGCGCCGAA GTTCCCGCCG TCCATGGTCG CCGAGATGCT GCTGCGGCAC
TCCGCCCGCA CGGGGGACGC GCGCAGCCTG GAGATGGTCG CGGTGACCTG CGAACGGATG
GCGCGTGGGG GCATCTACGA CCAGCTCGCC GGGGGGTTCG CCCGGTACAG CGTCGATGCG
ACGTGGACGG TCCCGCACTT CGAGAAGATG CTCTACGACA ACGCGCTGCT GCTGCGGGTC
TACCTGCACC TGTGGCGGGC CACCGGGTCG GCACTGGCCG AACGGGTGGT CCGGGAGACG
GCAGCCTTCC TGCTTGCGGA TCTGCGCACC CCCCAGGGGG GGTTCGCCTC GGCGCTCGAC
GCGGATGCCG TTCCCGCGGA TGCCGTTCCC GCGAGCGCTG CCCCGGCGGG AGCGCACCCC
GAGGAGGGGG CCAGCTACGC CTGGACACCC GCCCAGTTCG TCGCCGTCCT CGGGCCGGAA
GACGGGCGGT GGGCCGCCGG CGTCTTCGGC GTCACCGAGC AGGGCAGTTT CGAACGTGGT
ACGTCGGTGC TGCGCCTGCC GGCGGACCCG GACGATCCGG CGAGGTTCGC GGCGGTCCGC
GCGGCCCTGG CCGCGGCACG GGCGACCCGG CCGCAGCCGG CTCGCGACGA CAAGGTGGTG
GCCGCCTGGA ACGGCCTGGC CATCGCGGCG TTGGCCGAGG CTGGCGCGTT GTTCGACGAG
CCGGACTGGG TGCGGGCCGC CGAACAGGCC GCCGTCCTGC TCCGGGACGT GCATCTCGTC
AACGGACGCC TGCGCCGGAC GAGCCGGGAC GGGCGGGTTG GCGTGAACGC CGGGGTACTG
GAGGACTACG GCGACGTCGC CGAGGGACTG CTCACCCTGC ACCAGGTCAC CGGCGATCCG
GAGTGGCTCG CGCTCGCTGG GACGCTGCTG GACATCGTCC GTGATCGGTT CGCGGCCTCT
GACGGGGGGT TCTTCGATAC CGCGGACGAC GCGGAGGTGC TGTTGCGCCG GCCGCGGGAC
GACTCGGACT CGGCGACCCC GTCGGGCCAG GCGGCTGTCG CCGGCGCCCT GGTCAGCTAC
GCGGCGTTGA CCGGTTCCAC CGAACACCGG TCGGCCGCCG AGACGACGGT GGCGCGCGTC
GCTCCGTTGC TGGCCCGGGA TGCCCGGTTC GCCGGTTGGG CCGGTGCGGT CGCCGAGGCG
CTGCTGGCCG GGCCGGCCGA GGTCGCGGTC GTGGACAGCC CGGCGCTGGA ACGGCTCGCC
CGCCTCGGCA CGGCGCCCGG TGCGGTCGTG GTGACCAGCG GACCGCTGAC CGTCGGCCGG
GAAACGGCTG GGGTGTACGT GTGCCGGGAT TTCGTCTGTG AGCTGCCGGC CCGGACCGCA
GCCGAGGTGC GCCGCCAACT CGGGGTGAAG GTGGCCAGCT GA
 
Protein sequence
MPNKLAEQTS PYLLQHADNP VDWWPWSPAA FAEAARRGVP VLLSVGYASC HWCHVMAHES 
FEDAATAEYM NDHFVNIKVD REERPDVDSV YMDVTVALTG HGGWPMTVFL TPTAEPFFAG
TYFPPRPRPG MGSFRQVLTA VTEAWRTRRD EIEESGADIA RRLAEAATRG PASGLAAEIT
PALLDTAVAG LSARFDARHG GFGGAPKFPP SMVAEMLLRH SARTGDARSL EMVAVTCERM
ARGGIYDQLA GGFARYSVDA TWTVPHFEKM LYDNALLLRV YLHLWRATGS ALAERVVRET
AAFLLADLRT PQGGFASALD ADAVPADAVP ASAAPAGAHP EEGASYAWTP AQFVAVLGPE
DGRWAAGVFG VTEQGSFERG TSVLRLPADP DDPARFAAVR AALAAARATR PQPARDDKVV
AAWNGLAIAA LAEAGALFDE PDWVRAAEQA AVLLRDVHLV NGRLRRTSRD GRVGVNAGVL
EDYGDVAEGL LTLHQVTGDP EWLALAGTLL DIVRDRFAAS DGGFFDTADD AEVLLRRPRD
DSDSATPSGQ AAVAGALVSY AALTGSTEHR SAAETTVARV APLLARDARF AGWAGAVAEA
LLAGPAEVAV VDSPALERLA RLGTAPGAVV VTSGPLTVGR ETAGVYVCRD FVCELPARTA
AEVRRQLGVK VAS