Gene PCC7424_4571 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC7424_4571 
Symbol 
ID7108321 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 7424 
KingdomBacteria 
Replicon accessionNC_011729 
Strand
Start bp5055076 
End bp5056482 
Gene Length1407 bp 
Protein Length468 aa 
Translation table11 
GC content42% 
IMG OID643482788 
ProductMammalian cell entry related domain protein 
Protein accessionYP_002379801 
Protein GI218441472 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1463] ABC-type transport system involved in resistance to organic solvents, periplasmic component 
TIGRFAM ID[TIGR00996] virulence factor Mce family protein 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones64 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGAAGTG GGACAGGGAA TGGAAGATTA TCGCCTTTCA TGCTTCAAAG CGCTGTAGGG 
TTGACAATTT TAGTGGCTTT AGGGTTATTA GGCTGGCTGA TTTTGTGGCT GAGTAATTTT
AGTTTCGGAA ATCGCAGTTA TCGAGCAACC TTTATTTTTC CCAATGCTGG AGGGATGAGT
GTAGGAACGC GAGTCGCTTA CCGAGGGGTG AGAGTGGGGC GGATTGTTTC GATTAATCCT
GAACCGGGAG GGGTGGCCAT TGGGGTAGAA ATTTCACCCG CCGATCGCTT GATTCCTGCC
AACTCATCCA TCGAAGCAGT ACAAGCGGGT TTGGTAGGGG AAACCTCTAT CGATATTATT
CCTGTTGAAG GATTGCCCCC TGAAGGCGTT AAAGCTGACC CCCTCGATCC TAAGTGTAAT
CGGGCGATTA TTATTTGTAA TGGGTCAAGA TTACAAGGGG AGGGGAAATT AGATGTTAAT
GCTTTAATTC GCTCTCTTTT ACGTATTGCT GACGTGATCG ACGATCCCGA ATTTACGGGG
ACTGTTAGAG TCATTGCCCA AAGAACAACA GATGCACTCG GTAATATTAG TAGCTTTAGC
AAAGAGGCAT CGGGGTTAAT TAAAGATACC CGAAGCAACA GAACCATTAA CAGACTTGAT
AATACCTTGC TTTCTATCGA TCAGGCCGCC GATAGTGTTA ATCAAGCCGC CAGTCAGATC
GATCAGGTAA CAGGTGATAT CAATCAGGCC GCCGGTAGTC TTCAACGCTT AGGAAATCAA
GCCTCTAGTT TATTAGAAGA AGCGGAACGG CAAGATACGC TTAAAAATTT AAATTCTACT
TTAGTGTCTC TACAAGGATT TTCTGAACAA ATTCGCGGAT TTATTACCGT TAATCAGGGG
AATATTGGTA ACACTTTAGT TGGACTGGGA GAAACTAGCC AAGAATTAGC CGCGACTTTA
CGCAAGTTAG GCCCGATTTT AACTCAAGTC GAACAAAGTA AATTGGTGAC TAATTTGGAT
ACTATATCCA ATAATACTGC TGCCTTAACC GGGAATTTAC GGGATATTTC CGCTAAACTT
AATGATCCAG CCACAATTGT ACAATTACAA CAAATTCTTG ATGCTGCCCG TGCCGTTTTT
GAAAATGCCA ATAAAATTAC TTCAGATCTC GATGAATTAA CAGGAAATCC CCAGTTTCGG
AGAGATCTTA GGAGGGTAAT TGAAGGATTA AGTAATCTTG TTTCTTCTAG CGAACAACTG
CAACAACAAT TAGAATATGC TCAAGTTTTA AACCGGGTTG AGGCTGAAGT TAATCGAATT
CAGTCTAGGG AAAATATCTC TTTAAAGCCT AAAAATGTAA CTCCTACTCC TATTAAACCG
AAAAATATTA CCCCATCTCA ACCTTAA
 
Protein sequence
MRSGTGNGRL SPFMLQSAVG LTILVALGLL GWLILWLSNF SFGNRSYRAT FIFPNAGGMS 
VGTRVAYRGV RVGRIVSINP EPGGVAIGVE ISPADRLIPA NSSIEAVQAG LVGETSIDII
PVEGLPPEGV KADPLDPKCN RAIIICNGSR LQGEGKLDVN ALIRSLLRIA DVIDDPEFTG
TVRVIAQRTT DALGNISSFS KEASGLIKDT RSNRTINRLD NTLLSIDQAA DSVNQAASQI
DQVTGDINQA AGSLQRLGNQ ASSLLEEAER QDTLKNLNST LVSLQGFSEQ IRGFITVNQG
NIGNTLVGLG ETSQELAATL RKLGPILTQV EQSKLVTNLD TISNNTAALT GNLRDISAKL
NDPATIVQLQ QILDAARAVF ENANKITSDL DELTGNPQFR RDLRRVIEGL SNLVSSSEQL
QQQLEYAQVL NRVEAEVNRI QSRENISLKP KNVTPTPIKP KNITPSQP