Gene Francci3_3037 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3037 
Symbol 
ID3904390 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp3603626 
End bp3604927 
Gene Length1302 bp 
Protein Length433 aa 
Translation table11 
GC content71% 
IMG OID637880357 
Producthypothetical protein 
Protein accessionYP_482123 
Protein GI86741723 
COG category[S] Function unknown 
COG ID[COG3391] Uncharacterized conserved protein 
TIGRFAM ID[TIGR02276] 40-residue YVTN family beta-propeller repeat 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.701373 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGGAGG GGTCGACAGC GCAGCGCCGC CAGCCGGGGA CCAGGTCCCG GCGGCGTGGT 
CTGATCATGT TGGCCGGGTT GGCGGCCCTG GTGGTCGCAC TGCTGGTGGT GTGGCTGATC
CGGGTCAGGA GCGACCAGGC TACGGGGGCC GGTGCCGCCC GCCCGCCGGG TTCCGTTCCC
GCGCCGGGCC GGGGAGCCGA CTCTGCCCTG CCCCGTGGCG ATGGCGGCGT CGTGGACGTG
TACGCGCACG CGCGTGCCGG GATGCTGAGC CCGGTGGTCC GCGACGATCC GCCTCTCGTG
TACGTCCCGA ACCTGCGCGA CGGCACCGTC ACGGTCATCG ACCAGCGCAC GCTGCGGGTC
GTGGACACCT ACGCCGCCGG CCGCGAGCCT CAGCATGTGG TCCCCTCCTG GGACCTGCGG
ACCCTGTGGG TGAACAACAA CCGCGGCAAC AGCCTGTCAC CGATCGATCC GCGGACCGGT
CGGCCCGCCG GGCCCGCGGT ACCGGTCGCG GACCCGTACA ACCTGTACTT CACCCCCGAC
GGGGCCAACG CGATGGTCAT CGCCGAGGCG AACCACCACA TCGACTTCCG TGATCCGCAC
ACGTTCGCCC TACGGCACAG CCTGGACGTG GGCACGGCCT GCGCGGGCGT CAACCACGTC
GACTTCTCCG TCGACGGCTC CTATGCGATC GCCACCTGCG AGTTCGCCGG ACGACTCGTC
AAGATCGACA TCCCGCGTCA GCGGGTCATC GGATATCTGG ACCTCGGCCG GGACGCCGCT
CCACAGGACA TCAAGATCGA TCCGGCCGGC CGGATCTGGT ACGTGGCGGA CATGAACGCC
GACGGTGTCC ACCTCGTTGA CGGGGATCGC TTCACCAGGG TCGGTTTCGT CCGGACCGGG
CCGGAGACCC ACGGCCTGTA CCCGAGCCGC GACGGCCGAT TCCTGTACGT CGCCAACCGG
GGTGGCCACA TGGACTCCAT GAAACCGCCG TTCCCGCACT CCGGCGACCA GGGCTCGGTC
TCGGTCATCT CCTTCGCCAC CCGGACCGTC GTGGCCACCT GGCCGATCCC CGGTGGGGGA
ACCCCGGACA TGGGCAACGT CGATGCCAAC GGCTCGCGGC TGTGGCTGTC CGGCCGCCGC
AGCAACGTCG TGTACGTGTT CGACACCGGC GGGCCCGGAG GTTCGGAGCC GAAGGCCGGC
CGGCTGCTGG CTCGGATCCC CGTCGGCCGT GAGCCGCACG GTCTCGCGGT CTGGCCGCAA
CCCGGTCGCT ACTCCCTCGG CCATACTGGG ATCATGCGCT GA
 
Protein sequence
MPEGSTAQRR QPGTRSRRRG LIMLAGLAAL VVALLVVWLI RVRSDQATGA GAARPPGSVP 
APGRGADSAL PRGDGGVVDV YAHARAGMLS PVVRDDPPLV YVPNLRDGTV TVIDQRTLRV
VDTYAAGREP QHVVPSWDLR TLWVNNNRGN SLSPIDPRTG RPAGPAVPVA DPYNLYFTPD
GANAMVIAEA NHHIDFRDPH TFALRHSLDV GTACAGVNHV DFSVDGSYAI ATCEFAGRLV
KIDIPRQRVI GYLDLGRDAA PQDIKIDPAG RIWYVADMNA DGVHLVDGDR FTRVGFVRTG
PETHGLYPSR DGRFLYVANR GGHMDSMKPP FPHSGDQGSV SVISFATRTV VATWPIPGGG
TPDMGNVDAN GSRLWLSGRR SNVVYVFDTG GPGGSEPKAG RLLARIPVGR EPHGLAVWPQ
PGRYSLGHTG IMR